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^ Abstract 

2P We present several new results regarding Xg (n) , the maximum length of a Davenport- 

Schinzel sequence of order s on n distinct symbols. 
0> First, we prove that Xs{n) < n ■ 2(Vt!)«(")*+o(a(n)*-i) for s > 4 even, and 

, , Xs{n) <n- 2(V*!)«(")* log, a(n)+o(a(n)*) for s > 3 odd, where t = [{s - 2)/2j, and 

a{n) denotes the inverse Ackermann function. The previous upper bounds, by 
Agarwal, Sharir, and Shor (1989), had a leading coefficient of 1 instead of l/t\ in 
the exponent. The bounds for even s are now tight up to lower-order terms in the 
c/3 exponent. These new bounds result from a small improvement on the technique of 

I ^1 Agarwal et al. 

More importantly, we also present a new technique for deriving upper bounds 
CO for Xs{n). This new technique is very similar to the one we applied to the problem 

^ of stabbing interval chains (Alon et al., 2008). With this new technique we: (1) 

re-derive the upper bound of A3(n) < 2na{n) + 0[n^/a{n)) (ffist shown by Klazar, 
1999); (2) re-derive our own new upper bounds for general s; and (3) obtain im- 
proved upper bounds for the generalized Davenport-Schinzel sequences considered 
by Adamec, Klazar, and Valtr (1992). 

Regarding lower bounds, we show that A3(n) > 2na(n) — 0{n) (the previous 
00 lower bound (Sharir and Agarwal, 1995) had a coefficient of ^), so the coefficient 2 

is tight. We also present a simpler variant of the construction of Agarwal, Sharir, 
> and Shor that achieves the known lower bounds of Xs{n) > n ■ 2(Vt!)«(n)*-o(«(n)*-i) 

for s > 4 even. 
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function. 



1 Introduction 

Given a sequence S, denote by l^l the length of S, and by US'!! the number of distinct 
syml)()ls in S. If ii is another sequence, we write u G S ii S contains a subsequence u' 
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was supported by ISF Grant 155/05 and by the Hermann Minkoswki-MINERVA Center for Geometry 
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(not necessarily contiguous) which is isomorphic to u (i.e., u' can be made equal to u by 
a one-to-one renaming of its symbols). In this case we say that S contains u or that u 
is contained in S. Otherwise, we write u S and we say that S is u-free. For example, 
S = abcdbc contains u = abab, but it is ?;-free for v = abba. 

A sequence S = 010203 ... is called r-sparse if Oj / aj whenever 1 < \j — i\ < r — 1. 
In other words, S is r-sparse if every interval in S of length at most r contains only 
distinct symbols. 

A Davenport-Schinzel sequence of order s, for s > 1, is a sequence that is 2-sparse 
(i.e., contains no adjacent repeated symbols) and is u-free for u = ababab . . . of length 
s + 2. In other words, a Davenport-Schinzel sequence of order s does not contain any 
alternation a . . .b . . . a . . .b . . . of length s + 2 for any pair of symbols a, b. 

Let As(n) denote the maximum length of a Davenport-Schinzel sequence of order s 
on n distinct symbols (As(n) is finite for all s and n). We always take s to be fixed, and 
consider Xs{n) as a function of n. 

These sequences are named after Harold Davenport and Andrzej Schinzel, who first 
studied them in 1965 |5]. The main motivation for Davenport-Schinzel sequences is the 
complexity of the lower envelope of a set of curves in the plane. However, Davenport- 
Schinzel sequences have a large number of applications in computational and combina- 
torial geometry; the book [16] by Sharir and Agarwal is entirely devoted to this topic. 
Given the prominent role these sequences play in computational geometry, it is of great 
interest to derive tight asymptotic bounds for As(n). This goal is quite challenging, given 
the complicated form of the known bounds (see below) . There has been little progress 
in the problem for nearly 20 years. 

The bounds Ai(n) = n (no aba) and X2{n) = 2n — 1 (no abab) are quite easy to 
obtain. But for s > 3 the problem becomes much more complicated — it turns out that 
As(n) is slightly super linear in n. 

Hart and Sharir showed in 1986 [U [16] that Xs^n) = 0(na(n)), where a{n) denotes 
the inverse Ackermann function. (For the upper bound see also Sharir [15] and Klazar 
[8], and for the lower bound see also Wiernik and Sharir [20], Komjath pTO], and Shor 



The lower bound is due to Sharir and Agarwal [16] (based on the construction by 
Wiernik and Sharir [50] ) . The upper bound is due to Klazar . Klazar [U] asks whether 



The current upper and lower bounds for Xs{n) for general s were established by 
Agarwal, Sharir, and Shor in 1989 [2l[T^, and are as follows. Let t = [(s — 2)/2j. Then, 



The tightest known bounds for A3(n) are 
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(1) 



lim„^oo Xs{n)/ (na{n)) exists. 




A.(n) >n-2(i/*!)-(")*-o("(")"') 



s > 4 even. 



(2) 
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For odd s > 5 the asymptotically best lower bounds known are obtained by As(n) > 
As-i(n). 

Sharir and Agarwal's book |T6] contains a complete derivation of the current upper 
and lower bounds for Xs{n) for all s. 

In 2008 the author, together with Alon, Kaplan, Sharir, and Smorodinsky, conjec- 
tured that: 

Conjecture 1.1 ([4 ). The true bounds for Xs{n) are 

where t = [{s - 2)/2j. 

This conjecture is based on some surprisingly similar tight bounds that they obtained 
for an unrelated problem called stabbing interval chains with j -tuples. 



1.1 Generalized Davenport— Schinzel sequences 

Adamec, Klazar, and Valtr |IJ considered a generalization of Davenport-Schinzel se- 
quences, in which the forbidden pattern is not limited to abab . . ., but can be an arbitrary 
sequence. 

Let u (the forbidden pattern) be a sequence with ||n|| = r distinct symbols and length 
|m| = s. Then we denote by Ex„(n) the maximum length of an r-sparse, ?x-free sequence 
on n distinct symbols. The standard Davenport-Schinzel sequences are obtained by 
taking r = 2 and u = abab ... of length s -|- 2. 

The requirement of r-sparsity is necessary, since an (r — l)-sparse, n-free sequence 
can be arbitrarily long. The requirement of r-sparsity, however, ensures that Ex„(n) is 
finite. 

Generalized Davenport-Schinzel sequences have found several applications in dis- 
crete mathematics. Valtr [19] used generalized Davenport-Schinzel sequences to obtain 
bounds for some Turan-type problems for geometric graphs. Alon and Friedgut j3] used 
them to derive an almost-tight upper bound for the so-called Stanley-Wilf conjecture 
(the conjecture was later proved by Marcus and Tardos yjg by a different technique). 
For more information see the surveys by Klazar [9] and by Valtr [19]. More recently, 
Pettie [13] used generalized Davenport-Schinzel sequences to improve Sundar's [18] near- 
linear upper bound for the deque conjecture for splay trees. 



1.2 Formation-free sequences 

Klazar in 1992 [7] developed a general technique for bounding Ext((n) in terms of only 
r = and s = \u\. His technique is based on considering what we call formation-free 
sequences (our name). Given integers r and s, an {r, s)- formation is a sequence of s 
permutations on r symbols. For example, abed dcab dcab cdba dabc is a (4, 5)-formation. 
An {r, s) -formation- free sequence is a sequence which is r-sparse and does not contain 
any (r, s)-formation as a subsequence. 
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Denote by Fr^s{n) the length of the longest possible (r, s)-formation-free sequence on 
n distinct symbols. Let u be a sequence with ||u|| = r and \u\ = s. Since u is trivially 
contained in every (r, s)-formation, it follows that Ex„(n) < Fr^sin). 

Klazar made a slight improvement to this observation, by noting that if r > 2, then 
u is contained in every (r, s — l)-formation, and thus, 

Ex„(n) < Fr^s-i{n) for r > 2. (3) 

(The case r = 1 is not interesting in any case.) Klazar proved the bound 

F,,,(n) <n.20H")^"'), 

where the O notation hides constants that depend on r and s. Together with (|3]), this 
implies that 

Ex„(n) < n-2°H")'"'). 



1.3 Our results 

In this paper we present several new results. 

First, we make a small improvement on the argument of Agarwal et al. [21 [16] and 
prove: 

Theorem 1.2. Let s>3 be fixed, and let t = [{s - 2)/2j . Then 

r^.2(l/t!)-(n)'+0(-(n)-^), , 
~ In • 2(l/t!)"(n)'log2"W+0("{")*)^ g odd 

Thus, the upper bounds for As(n) are now in line with Conjecture and for s even 
they are also tight up to lower-order terms in the exponent. 

More importantly, we also present a new technique for deriving upper bounds for 
Xs{n). Our new technique is based on some recurrences very similar to those used by 
Alon et al. for the problem of stabbing interval chains with j'-tuples. 

With our new technique we re-derive Klazar's upper bound ([T| for X^in), as well 



as our new bounds in Theorem 1.2 for As(n), s > 4. We also apply our technique to 



formation- free sequences, proving that: 
Theorem 1.3. For s > 4 we have 



Fr,s{n) < 



-^.2{iA!Hn)*+o(a(n)-i)^ s odd; 



where t = [(s — 3) /2j . (The O notation hides factors dependent on r and s.) 

As an aside, we improve on Klazar's bound (|3]): 
Lemma 1.4. Let u be a sequence with \\u\\ = r, \u\ = s. Then, Ex„(n) < Fr^s-r+i{n-) . 
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This, together with Theorem 1.3, yields 



Theorem 1.5. Let u be a sequence with \\u\\ = r, \u\ = s, and s > r + 3. Let t 
[(s-r-2)/2j. Then, 



Ex„(n) < 



^ . 2im)a{ny log, a{n)+0(a(n)*) ^ _ ^ 



Note that Theorem 1.5 is a generalization of Theorem 1.2 Taking r = 2 and u 



abab ... of length s + 2 yields the theorem once again. 
Regarding lower bounds, we prove: 

Theorem 1.6. A3(n) > 2na{n) — 0{n). 

Corollary 1.7. lim„^oo A3(n)/(na(n)) =2. 

Finally, we present a simpler variant of the construction of Agarwal, Sharir, and 
Shor ^ liGJ, which achieves the lower bounds ^ for s > 4 even. 

1.4 The Ackermann function and its inverse 

Let us define (our version of) the Ackermann function and its inverse. 

The Ackermann hierarchy is a sequence of functions j4fc(n), for k = 1,2,3, .. . and 
n > 0, where Ai{n) = 2n, and for k >2 we let Afc(n) = A^^}-^{1). (Here /*^") denotes the 
n-fold composition of /.) Alternatively, the definition of A^^n) for k >2 can be written 
recursively as 

w ^ I 1' if n = 0; 

Ak{n) = <^ (4) 
\Ak-i[Ak{n — l)j , otherwise. 

„...2 

We have A2{n) = 2", and A^i^n) = 2 is a "tower" of n twos. Each function in this 
hierarchy grows much faster than the preceding one. Namely, for every fixed k and c we 
have Ak+i{n) > A^j^\n) for all large enough n. 

Notice that Afc(l) = 2 and Ak(2) = 4 for all k, but ^a,.(3) already grows very 
rapidly with k. We define the Ackermann function as A(n) = An(3). Thus, A(n) = 
6, 8, 16, 65536, . . . for n = 1, 2, 3, 4, . . . 

For every fixed k we have A{n) > ^^(n) for all large enough n. It is also easy to 
verify that 

A{n) = An-2 {A{n - 1)) , for n > 3. (5) 



^Klazar himself [7] speculated that it should be possible to achieve roughly Exu(n) < n ■ 2*^^"^"-' ^ ) . 

■^The Ackermann function is usually defined by "diagonalizing" the hierarchy, letting A'{n) = A„{n). 
This does not make any asymptotic difference, since A'{n — 2) < A{n) < A'{n — 1) for n > 5. (There 
are several other definitions of the Ackermann hierarchy and function in the literature, all of which 
exhibit equivalent rates of growth.) We prefer the above definition because, first, "diagonalization" is 
unnecessary, and second, the corresponding definition ^ of a(a;) comes out simpler. For other references 
where a{x) is defined as in (|9| see Pettie [13] and Seidel [14| slide 85]. 
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We then define the slow-growing inverses of these rapidly-growing functions as 



Okix) = min{n | Ak{n) > x}, (6) 
a{x) = min{n | A(n) > x}, (7) 

for all real x > 0. 

Alternatively, and equivalently, we can define these inverse functions directly without 
making reference to and A. We define the inverse Ackermann hierarchy by ai{x) = 
\x/2'] and 

/ N fo, if X < 1; 

afc(x) = < (8) 
I 1 -|- afc(afc_i(x)), otherwise; 

for k > 2. In other words, for each k > 2, afc(x) denotes the number of times we 
must apply ak~i, starting from x, until we reach a value not larger than 1. Thus, 
0^2(2;) = [log2 x] , and a^^x) = log* x. Finally, we define the inverse Ackermann function 

by 

a(x) = mm{k \ afc(x) < 3}. (9) 

It is an easy exercise (only slightly tedious) to prove by induction that the above 
two definitions of and a are exactly equivalent. 



1.5 Organization of this paper 



Sections [2]-[5] contain our upper-bound results. In Section [2] we show how Theorem |1.2 
reduces to bounding a function denoted i^sim, n). In Sectionjsjwe improve the technique 
of Agarwal et al. [21 [16] for bounding 'il)s{m,n). In Section E] we present an alternative 
technique, which yields the same improved bounds for ipsim^n). 

Section [5] addresses formation-free sequences. We first prove Lemma 1.4, and then 



we extend our new technique to formation- free sequences, proving Theorem 1.3 



Sections |6}|7] contain our lower-bound results. Section |6] presents our construction 
for \z{n) that proves Theorem 1.6 Section [7] contains our simplified construction of 
Davenport-Schinzel sequences of even order s > 4. 

Appendices [A}|C] contain some technical calculations. 

For completeness, we provide proofs in this paper of most of the previous results we 
rely on. 



2 Upper bounds for Davenport-Schinzel sequences 

The upper bounds for \s{n) are obtained by considering a function with an additional 
parameter m: 

Definition 2.1: Let m, n, and s be positive integers. Then ips{m,n) denotes the 
maximum length of a Davenport-Schinzel sequence of order s on n distinct symbols 
that can be partitioned into m or fewer contiguous blocks, where each block contains 
only distinct symbols. 
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The relation between As(n) and ipsi^^n) is as follows: 

Lemma 2.2 (OITB]). Let ips-2i'n) be a nondecreasing function in n such that \s-2{n) < 
mp s~2{n) for all n. Then, 

\s{n) < <{>s-2{n) {^s{2n, n) + 2n) . (10) 

Proof. Let be a Davenport-Schinzel sequence of order s on n symbols with maximum 
length \s{n). Partition S greedily from left to right into blocks 5i, . . . , 5"™, such 
that each Si is a sequence of order s — 2; in other words, when scanning 5 from left to 
right, start a new block 5",+! only if an additional symbol would cause Si to contain an 
alternation of length sj^ We claim that m, the number of blocks, is at most 2n. 

Indeed, consider some block Si for i < m. Since Si was extended maximally to the 
right, it must contain an alternation abab ... of length s — 1, which is extended to length 
s by the first symbol of Si+i (which is either a or b, depending on the parity of s). But 
then, we cannot have both b appearing in a previous Sj, j < i, and b or a (depending 
on the parity of s) appearing in a subsequent Sj, j > i, because then 5 would contain 
a forbidden alternation of length s + 2. 

Hence, each block Si (including the last one Sm) contains either the first occurrence 
or the last occurrence of at least one symbol. Thus, m < 2n. 

Let Tii = \\Si\\. Then, 

m m m m 

\s{n) = |5| = ^ \Si\ < ^ Xs-2{ni) < '^nnps-.2{ni) < i/9^_2(n) n^. 

i=l i=l 1=1 1=1 

Let us now bound X^rij. Construct a subsequence S' of S by taking, for each block 
Si, just the first occurrence of each symbol in Si. Note that S' has length \S'\ = 
and, being a subsequence of S, it contains no alternation of length s + 2. Furthermore, 
S' is decomposable into m blocks of distinct symbols S[, . . . ,5*^. However, S' might 
contain adjacent equal symbols at the interface between blocks, but by removing at most 
one symbol from each block S'^, we can obtain a sequence S" with no adjacent equal 
symbols. Therefore, \S"\ < ips{rn,n), and so < Vs("i,n) + m. Since m < 2n, we 
conclude that 

Xs{n) < (fs-2{n) {ips{2n, n) + 2n) . □ 

In particular, for s = 3 we have \z{n) < %lj^{2n,n) + 2n (by taking y?i(n) = 1, since 
Ai(n) = n). Actually for s = 3 we have \^{n) = tp^{2n, n) (Hart and Sharir |6l 116]). 

The main issue, then, is to bound ipsim^n). We present two different techniques for 
bounding %ljs{m,n). The first one is a minor modification of the technique of Agarwal 
et al. [21 [16] . The second one is our new technique. Both techniques yield the following 
bounds: 

^This greedy left-to-right approach is in fact optimal — it yields a partition of S into the minimum 
possible number of blocks of specified order r < s. 
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Lemma 2.3. For s = ?> we have 



ip^{m,n) = 0(kmak{'m) + kn) for all k. 
In general, for every fixed s >3 we have 

ips{m,n) < Cs^k{mak{my~'^ + n) for all k, 



for some constants Cs k of the form 



Cs.k 



'2(iA!)fc'±o(/.-i)^ . even; 



where t = [{s - 2)/2j. 

(Equivalent bounds for ip-i(m,n) and tp4(m,n) were previously derived by Hart and 
Sharir [HI HB], and Agarwal, Sharir, and Shor O [TB], respectively. For s > 5 these are 
improvements over |2l [16], which for s > 6 yield improved bounds for As(n).) 

From Lemmas |2.2| and 2.3 it follows that Xs{n) = o{na£{n)) for every fixed i: Just 
take A: = ^ + 1 in Lemma 2.3 bounding ips-2{n) in Lemma |2.2| by induction. Here the 
magnitude of the constants Cg k is irrelevant. 



But we can also derive a tighter bound for As(n), namely Theorem 1.2, if we let k 
grow very slowly with m; for this the dependence of Cg k on k is significant: 



Proof of Theorem 1.2 Take k = a{m) in Lemma 2.3 (recalling that aci(m){^) ^ 3 by 
definition), and substitute into Lemma 2.2 For s = 3,4 we get X^^n) = 0{na{n)), 
X^in) = 0{n ■ 2"("')) (by taking fi{n) = 1, <^2(«) = 2). For s > 5 we bound As-2("') 
by induction on s, and we substitute the resulting bound for ips-2{n) into (10). We 



obtain the desired bounds (the factor (ps^2{n-) only affects lower-order terms in the 
exponent). □ 



3 Bounding ?/;s(m,n) — improving the known technique 



In this section we prove Lemma 2.3 by making a small improvement on the technique 
of Agarwal et al. [TB] . The main ingredient in the proof is the following complicated- 
looking recurrence relation. This is a small modification of the recurrence in [21 [16] (and 
more complicated). 

Recurrence 3.1. Let m, n > 1 and b < m be integers, and let 

m = mi + 771-2 + • • • + "2-6 

be a partition of m into b nonnegative integers. Then, there exists a partition of n into 
b +1 nonnegative integers 

n = ni + n2-\ Vub + n*, (11) 
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and there exist nonnegative integers n^, 77-2, . . . , < n* satisfying 

nl + n*2 + ---+nl<'il:s{b,n*)+b, (12) 

such that 

b 

ips{m, n) < 2V's-i(m, n*) + 4m + ^ (^V's-2("T'i, n*) + tpsim, rii)^ . (13) 

1=1 

Here it is appropriate to repeat Matousek's advice [12', p. 179] to first study the 
proof below and then try to understand the statement of the recurrence. 

Proof. Let 5 be a maximum-length Davenport-Schinzel sequence of order s that is 
partitionable into m blocks Si, ... , Sm, each of distinct symbols. Thus, |5| = ^s(m, n). 
Group the blocks Si, ... , Sm into b layers Li, L2, . . . , L^, from left to right, by letting 
each layer Lj contain mi consecutive blocks. 

We partition the alphabet of S into two sets of symbols. The local symbols are those 
that appear in only one layer, and the global symbols are those that appear in two or 
more layers. Let rii be the number of symbols local to layer Lj, for 1 < i < 6, and let 



n* be the number of global symbols. Equation (11) follows. 



For each layer Li, let denote the number of global symbols that appear in Lj. 



Trivially n* < n* for all i. To see that ( 12 ) holds, build a subsequence S' of S by taking, 
for each layer Lj and each global symbol a in Lj, just the first occurrence of a within Lj. 
The sequence S', being a subsequence of S, does not contain any alternation of length 
s + 2. Furthermore, S' consists of b blocks of distinct symbols, corresponding to the b 
layers of S. 

However, S' might contain pairs of adjacent equal symbols at the interface between 
blocks. But there are at most 6 — 1 such pairs of symbols, and by deleting one symbol 



from each pair, we finally obtain a Davenport-Schinzel sequence. Bound (12) follows. 

Each occurrence of a global symbol a in a layer Lj is classified into starting, middle, 
or ending, as follows: If a does not appear in any previous layer Lj, j < i, we say that 
a is a starting symbol for Lj. Similarly, if a does not appear in any subsequent layer Lj, 
j > i, then a is an ending symbol for Li. If a appears both before and after Lj, then a 
is a middle symbol for Lj. 

Decompose S into four sequences Ti,T2,Ts,T4 (not necessarily contiguous), as fol- 
lows: Let Ti contain all occurrences of the local symbols of S. Let T2 contain all 
occurrences of the starting global symbols in all the layers of S; similarly, let T3 contain 
all occurrences of the middle global symbols, and let T4 contain all occurrences of the 
ending global symbols in all the layers of S. Thus, |Ti| -|- • • • -|- IT4I = tps{m,n). Each 
sequence Ti, . . . , T4 inherits from S the partition into b layers, in which the i-th layer is 
further partitioned into rrii blocks. 

Each of the sequences Ti, . . . , r4 might contain pairs of adjacent equal symbols, but 
these can only occur at the interface between adjacent blocks. Hence, by removing 
at most m — 1 symbols from each sequence, we obtain sequences T[,...,T^ with no 
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adjacent equal symbols. Thus, 'i/'s("i,n) < \T[\ + • • • + IT4I + 4m. We now bound each 
of |T{|, . . . , IT4I individually. 

Let us first consider T[. The i-th layer in T[ is a Davenport-Schinzel sequence of 
order s on nj symbols, and it consists of rrii blocks, each of distinct symbols. Thus 

b 

\T[\ < '^ips{mi,ni). 

i=l 

Next consider Tg. We claim that each layer in T2 is a Davenport-Schinzel sequence 
of order s — 1. Indeed, suppose for a contradiction that some layer in contains an 
alternation abab . . . of length s + 1. Then, since a and b are starting symbols for this 
layer, they must both appear in S in some subsequent layer, and so S would contain an 
alternation of length s + 2, a contradiction. 

Furthermore, since each global symbol is a starting symbol for exactly one layer, the 
layers in have pairwise disjoint sets of symbols, so all of is a Davenport-Schinzel 
sequence of order s — 1. A similar argument applies for Tjj^. Thus, 

|r^|,|ri| <^,_i(m,n*). 

Finally, consider T3. Each layer in T3 is composed of middle global symbols, which 
appear in S in both previous and subsequent layers. Therefore, no layer in Tg can 
contain an alternation of length s, or else S would contain an alternation of length s + 2. 
Thus, each layer in Tg is a Davenport-Schinzel sequence of order s — 2. (However, the 
whole Tg is not necessarily of order s — 2.) Since the i-th layer in T3 contains n* different 
symbols and is partitioned into rrii blocks, each of distinct symbols, we have 



Bound (13) follows. □ 



Remark 3.2: Our key improvement over the method of Agarwal, Sharir, and Shor lies 
in the bound for \T^\. They noted that each layer in T3 is a sequence of order s — 2, 
but they did not use the fact that the blocks in each layer have distinct symbols. In 
addition, they did not introduce the variables n*. 

3.1 Applying the recurrence relation 



We apply Recurrence |3.1| repeatedly to obtain successively better upper bounds on 
il^sim^n). We first obtain a polylogarithmic bound, and then we use induction to go all 
the way down the inverse Ackermann hierarchy. 

For s > 3 let m-o(s) be a large enough constant (depending only on s) such that 

m > 2 + 2 [log2 m] ''"^ for ah m > mo(s). (14) 
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Define integers Ps.2, Qs,2 for s > 1 by 

Pl,2 = P2,2 = 0, Ql,2 = 1, Q2,2 = 2, (15) 

and, for s > 3, 

P,,2 = 4P,_i,2 + 2P,_2,2 + 2Qs„2,2 + 8, ^^^^ 

Qs,2 = max {mo(s), 2Qs-i,2 + 2(3s-2,2}- 

The reason for our choice of mQ{s) will become apparent later on, in the proof of 
Lemma 3.4 (Also recall that we take s to be a constant, so the growth of Ps,2i Qs,2 in 
s is irrelevant for us.) 

Our poly logarithmic bound is as follows: 

Lemma 3.3. For all m, n, and s, we have 

ips{m, n) < Ps^2 "i(log2 m)*~^ + Qs^2 n. 

Proof. We proceed by induction on s. If s = 1 then tpi^rrijn) < n, and if s = 2 then 
'4'2{'m, n) <2n — 1, and the claim holds. So let s > 3. 

For each s we proceed by induction on m. If m < mo(s) then ips(m,n) < mo(s)n < 
Qs,2 n, and we are done. So assume m > mo{s). 

We apply Recurrence 3.1 with 6 = 2. Let nii = [m/2\ and m2 = [m/2], so 
mi + 1712 = m. Let us bound each term in the right-hand side of ([Ts]) separately. 

The term 2iljs-i{'m.,n*) is bounded, by induction on s, by 

2il)s-i{m, n*) < 2P,_i,2 m(log2 my~^ + 2Qs-i,2 n* . 

Next, we bound the term Yl'i=i '0s-2(^T^^, n*). Using again induction on s, and applying 
log2 rrii < log2 m, we get 

2 

V's-2(?n-j, n*) < Ps-2,2 "l(log2 mY'* + Qs-2,2("-l + n2). 

1=1 



Now, applying (12), we bound nl + loosely by + n2 < 'i/'s(2,n*) + 2 < 2n* + m. 
Thus, being again very loose, we get 

2 

ips-2{'mi, n*) < m(log2 m)''"^(Ps_2,2 + Qs-2,2) + 2(5s-2,2 n* . 

i=l 

Next we bound the term Yld=i''i^s{fni,ni), using induction on m. Applying log2 mj < 
log2 m — ^, which is true for m > 3, and using the fact that (x — \Y~'^ < ,x'*~^ — \x'^~^ 
for all X > 2, we get 

2 2 



Tps{mi,ni) < ^ (Ps,2 mi(log2 rriiY ^ + Qs,2 ^j) 



1=1 i=i 



< Ps,2'm{log2mY ^ -^Ps,2m{log2mY ^ + (5.,2(n - n*). 
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Finally, we bound 4m (very loosely for s > 4) by 4m(log2 mY ^. Putting everything 
together, we get 



ips{m,n) < Ps ^2 m(log2 my ^ + Qs,2n 
+ m(log2 m)^^^ 
+ {2Qs-l,2 + 2Qs-2,2 - Qs,2)n*. 



2i^s-l,2 + Ps-2,2 + Qs-2,2 + 4 — ^^^,2 



By the definition of Ps^2 and Qs,2 in (16), the last two lines are non-positive, so 

ipsi'm, n) < Ps^2 "i(log2 mY""^ + Qs,2 n. 



□ 



We are now ready to go all the way down the inverse Ackermann hierarchy. Define 
integers Ps,k, Qs,k for /c > 3, s > 1 by 



Pi 



l,k 



P 



2,k 



0, 



Ql,k — 1, Q2,k — 2, 



and, for s > 3, 



Ps,k 
Qs,k 



Qs-2,fc(l + Ps,k-l) + 2(is Pg-l^k + d'g Ps-2,k + 4, 
Qs-2,k Qs,k-1 + 2(5s_l,fc, 



(17) 



for some constants ds and d'^ to be specified later, with Ps^2, Qs,2 as in (15), (16). These 



quantities -Ps,fc) Qs,k will give rise to Cg^k of Lemma 2.3 



Lemma 3.4. For every s there exists a constant Cg such that 

ipsim, n) < Ps^k m{ak{m) + CsY~'^ + Qs,k n 
for all integers n, m, s, and k. 



(18) 



The proof is similar to the proof of Lemma 3.3 though more complex, since we 
proceed by induction on k for each s. Before delving into the actual details, we give a 
brief sketch of the proof. For the purposes of this sketch, denote the right-hand side of 



3.1 



(18) by rs_fc(m, n). Now refer to equation (13) in Recurrence 

The proof proceeds as follows. We bound the term ■05_i(m, n*) by r<j_i ^(m, n*). We 
bound the terms ■ips-2{mi,n*) by Ts-2,k{'mi,n*); this produces the term Qs-2,kYl^i ^ 
on which we apply (12). We bound the resulting term ipsibjU*) by Ts.k-iib,n*) (here is 
where we use induction on k). Finally, we bound the terms ^s(mj,nj) by Tg^k{m,i,ni) 
by induction on m (since rrii < m for every i). 



Proof of Lemma 3.4 By induction on s. As before, the claim is easily established for 
s = 1, 2, so assume s > 3 is fixed. 

For each s we proceed by induction on k. If /c = 2 then the claim reduces to 
Lemma 13.31 so assume k> 3. 
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(19) 



By our induction assumption on s, we have 

^ps-l{m, n) < Ps-i,fc m{ak{m) + Cs-if^'^ + Qs-i,k n, 
V's-2(?7i, n) < Ps-2,k m{ak{m) + 0^-2)''"^ + Qs-2,k n, 
for all m and n. 

Here it is convenient to work with a slight variant 3fc(x) of the inverse Ackermann 
hierarchy. Define ak{x) for A; > 2, x > by a2{x) = a2{x) = [log2 x] , and for /c > 3 by 
the recurrence 

^ - - J 1, if X < mo{s); 

1 1 + afc(l + 2S/fc_i(x)*~^) , otherwise; 



ak[x) 



(20) 



with mo(s) as given in ( |14[ ). (Compare (20) to the definition ([8]) of afc(x); our choice of 
mo(s) guarantees that 3fc(x) is well-defined for all k and x.) 

The functions 3fc(x) are almost equivalent to the usual inverse Ackermann functions 
afc(x). In fact, there exists a constant Cg, depending only on s, such that |3fe(x) — 
afc(3;)| ^ Cs for all A; and x. See Appendix B of [1] for a general technique for proving 
bounds of this type (or see Appendix [C] in this paper). 

We will show that 



(21) 



for all n, m, and k. We will do this by induction on k, and for each k by induction on 
m. Then our claim will follow. 

If m < mo(s), then ^s(m,n) < mo{s)n < Qs,2T^ ^ Qs,k^: and we are done. So 
assume m > mo(s). 

We want to translate the bounds (19) into bounds involving 3^. Since afc(?n.) < 
3fc(?T^) + Cs and 3fc(m) > 1, it follows (being somewhat slack) that there exist multi- 
plicative constants dg, d'g such that 

'il)g-i{m, n) < dg Ps-i.fc mak{mY~^ + Q^-i.fc n, 
'4ig-2{m, n) < d'g Ps-2,k mak{mY~^ + Qs-2,k n, 

for all n and m. 



(22) 
(23) 



Assume by induction on k that (21) holds for k — 1. Choose 

m 



Let m,- 



[m/6J or [m/ft] for each i, such that Y2mi = m. We claim that 



rrii <1+ 2ak-i{m) 



s-2 



for ah 1 < i < 6. 



(24) 



(25) 



Indeed, by our choice of mo(s) as given in (14), we have 3fc_i(m) 
m/2 for all m > mo(s). Thus, 



< [log2m" 



s-2 



< 



m m 

< 1 + - < 1 + — — 

b m/ak-iym) 



1-2 . 

s-2 



1 + 



■mak_i{m) 
m — 3fc_i(m)*^2 



1 



< 1 + 



mdik-iijny 2 



m 



m/2 



1 + 23fc_i(m) 



s-2 
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Let us bound each term in the right-hand side of (13). We first bound the term 



2V's-i(m, n*) using (22), and we obtain 

2V's-i(m, n*) < 2ds Ps-i,k mak{my~^ + 2Qs^i^k n* . 



Next we bound '}2ii=i'^s-2{fni,n*) using (23). Observing that ak{rni) < ak{rn), 

b b 

'4>s-2{mi,n*) < ^ (4 Ps-2,k miakimiY'^ + Qs-2,k n*) 



i=l 



i=l 



<d!gPs-2,kmak{my ^ + Qs-2,fc^ra*. 



(26) 



i=l 



Now we apply (12), and we bound %l)s{b,n*) by (21) with /c — 1 in place of k. 



n* < Mb, n*) + b< P,,fc_i bak-iibY~^ + Qs,k-i n* + b. 



i=l 



By our choice of 6 in (24), we have afc_i(6)* ^ < ak-i{rn)^ ^ < m/b, so, being somewhat 
slack. 



^ < < Ps,k-i m + Qs,k-i n* + m< mak{my ^(1 + P^.fc-i) + Qs,k~i 



n . 



i=l 



Substituting this into (26), and being slack again, we get 

6 



^ Vs-2(?n-i, O < makimf ^ {d!^ Ps~2,k + Qs-2,k{'^ + Ps,k^i)) + Qs~2,k Qs,k-i n* . 



i=l 



Next we bound ^j^-,^ '^^(mj, n^), applying (21) by induction on m (since < m): 

b b 

ips{m,ni) < ^ {Ps,k miakimiY"'^ + Qs,k ni) . 



i=l 



1=1 



But by (25) and (20), 



akirrii) < afc(l + 23fc_i(m)'' = ak{m) - 1. 
Further, we have {x — ly^"^ < x^^'^ — x'^~^ for all x > 1. Therefore, 
6 

ips{rni,ni) < Ps,fc m (akimY''^ - afc(m)'*"^) + Qs,k{n - n*). 

1=1 
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Finally, we bound 4m very loosely by 4mafc(m)'^ . Putting everything together, we 



get 



makimy-^ (2^ + < P,_2,fc + Qs~2,k{^ + Ps,k-i) + 4 - Ps,k) 

(2Qs-l,fc + Qs-2,k Qs,k-l — Qs,k)n* . 



By the definition of Ps^k and Qs,k in (17), the last two lines equal zero, and we get 

i>s{m, n) < Ps^k makimY''^ + Qs^k n. 



□ 



All that remains is to analyze the asymptotic growth of Pg^k^ Qs,k in k for fixed s. 
We have 

P3,k,Q3,k = Q{k), P4,fc,Q4,fc = e(2^), 

and, in general, letting t = [(s — 2)/2j , 



-Psjfe ) Qs,k 



'2(i/t!)/=*±o(fc'-)^ s>4even; 
2ii/ti)kHog,k±o{k^) 5>3odd 



(27) 



(see Appendix [B] for the proof). Thus, Lemma 3.4 is equivalent to Lemma 2.3 



Remark 3.5: The investment we made in using a more complicated recurrence (Re- 
currence 3.1 instead of the one used by Agarwal et al. [2] |T6]) paid off in Lemma 3.4 



Besides being tighter, the lemma also has a simpler form. The corresponding bound in 
[21 [16] is of the form 

ipsim, n) < J^s,k{n) ■ mak{m) + Qs,k{n) ■ n, 

where J-s,k{n) and Qs,k{n) are functions of a{n). Our constants Ps.k-, Qs,k, in contrast, 
do not depend on n. 



4 A new technique for bounding iljs{m,n) 

We now present an alternative technique for bounding ips(m,n). Our new technique 
is based on a variant of Davenport-Schinzel sequences, in which we turn the problem 
around, in a sense. We call our variant sequences almost-DS sequences. 

An almost-DS sequence of order s with multiplicity k and m blocks (or an ADS^(m)- 
sequence, for short) is a sequence that satisfies the following properties: 

• It is a concatenation of m blocks, each block containing only distinct symbols. 

• Each symbol appears at least k times (in different blocks, so we must have m > k 
for there to be any symbols at all). 

• The sequence contains no alternation abab ... of length s + 2. 
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Note that we do allow repetitions at the interface between adjacent blocks (this 
simplifies matters). This is why these are almost Davenport-Schinzel sequences. 

We now pose a different problem: We ask for maximizing the number of distinct 
symbols. Let n|(m) denote the maximum number of distinct symbols in an ADS|(m)- 
sequence. (Note that n|(m) = for m < k.) 

The connection between ^s(m,n) and n|(m) is based on the following lemma: 

Lemma 4.1. For all s, n, m, and k we have 7/^5(771,71) < k(ll^{m) + rij. 

Proof. Let S be a maximum-length Davenport-Schinzel sequence of order s onn distinct 
symbols that is partitionable into m blocks, each of distinct symbols. Thus, IS"! = 
ips{m,n). Let /c > 1 be a parameter. 

We transform S into another sequence S' in which every symbol appears exactly k 
times as followsj^ For each symbol a, group the occurrences of a in S from left to right 
into "clusters" of size A;, deleting the last remaining <k — l occurrences of a. Make the 
occurrences of a in different clusters different, by replacing each a in the i-ih cluster by 
a new symbol a-i . 

We deleted at most kn symbols from S, so \S'\ > \S\ — kn. On the other hand, S' 
is clearly an ADS|(?77,)-sequence (the symbol deletions might have created repetitions 
at the interface between blocks, but these are permitted in almost-DS sequences; on 
the other hand, the symbol replacements do not introduce any forbidden alternations). 
Thus, S" contains at most n|(m) distinct symbols. Since each symbol appears exactly 
k times, we have \S'\ < k ■ n|(?TT,). The claim follows. □ 

Thus, the problem of bounding ips{m,n) reduces to bounding n|(?TT,). 

4.1 Bounding the number of symbols in ADS sequences 

We first derive some basic results: For every s > 1, if we take k < s then n^(7n) = 00, 
but if we take k = s + 1 then n|(7n-) is already finite. 

Lemma 4.2. For all s > 1, m > s we have n^(7n) = 00. 

Proof. Take the sequence 

abc cba abc 

with s blocks, with arbitrarily many symbols in each block. Each symbol appears s 
times, and the maximum alternation is of length s + 1. □ 

Lemma 4.3. We have 02(777,) = m — 1. 

Proof. Let S be an ADS2(7r7)-sequence. Since S cannot contain an alternation aba, each 
symbol must have all its occurrences contiguous. Given that S contains m blocks, the 
sequence that maximizes the number of distinct symbols is 

1 12 23 ... (777 - 2)(r77 - 1) (tti - 1), 

with 777 — 1 distinct symbols. □ 
■^A similar argument has been used by Sundar [181 Lemma 9] for a diflterent problem. 
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Lemma 4.4. For all s > 2 we have Ill_^_^{m) < = 0(m* 

Proof. Suppose for a contradiction that there exists an ADSs^i(m)-sequence S with 
n = 1 + {^Zi) distinct symbols. Thus, each symbol appears in at least s + 1 out of 
m different blocks. For each symbol a, consider the s — 1 "internal" occurrences of o, 
meaning, all occurrences except the first and the last. These internal occurrences can 
fall in any of the m — 2 "internal" blocks of S (excluding the first and last blocks). 

By our choice of n, there must be two symbols a, b whose internal occurrences fall in 
the same s — 1 out of m — 2 internal blocks. These internal occurrences create an alterna- 
tion of length at least s (in the best case, they form the subsequence abba ab . . .). Since 
both a and b also appear before and after this subsequence, S contains an alternation 
of length s + 2, a contradiction. □ 

We now bound n|(m) by deriving recurrences and solving them, in a manner almost 
entirely analogous to [4]. We begin with the following recurrence and corollary, which 
are analogous to Lemma 3.2 in [4J: 

Recurrence 4.5. For every s > 3 and every k and m we have 

n^fc_i(2m) < 2ni,_i(m) +2n^-Hm). 

Proof. Given an ADS2;j_x(2m)-sequence S", partition the 2m blocks of S into a "left 
half and a "right half of m blocks each. The symbols of S fall into four categories: 

• Symbols that appear only in the left half. Taking just these symbols produces an 
ADS2fc_]^(m)-sequence, so there are at most n|^_-^(m) such symbols. 

• Symbols that appear only in the right half. There are also at most n2^_j^(m) such 
symbols. 

• Symbols that appear in both halves, but appear at least k times in the left 
half. Taking just these symbols, and just their left-half occurrences, produces 
an -sequence S' . (An alternation abab . . . of length s -|- 1 in S" would 
be extended to length s -|- 2 by an a or 6 that appears in the right half.) Thus, 
there are at most n^~^(m) of these symbols. 

• Symbols that appear in both halves, but appear at least k times in the right half. 
There are also at most n^~^(m) such symbols. □ 

Corollary 4.6. For every fixed s >2, if we let k = 2'^^^ + 1, then 

n|(m) = 0{m{logmy-^) 
(where the constant implicit in the O notation might depend on s). 



Proof. Apply Recurrence 4.5 using induction on s, using Lemma 4.4 as base case for 
s = 2. □ 
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The following recurrence and corollary for n^(m) are analogous to Recurrence 3.3 
and Lemma 3.5 in [4 : 

Recurrence 4.7. Let t be an integer parameter, with t < yjm. Then, 

^l{m) < (i + y ) n|(t) + nts (i + 7) + 3"^- 

Proof. Take a sequence S that maximizes n|(m). Let b = [m/t] < 1 + m/t. Partition 
the m blocks of S from left to right into b layers Li, . . . , Lf, of at most t blocks each. 

We classify the symbols of S into different types. A symbol is local for layer Lj 
if it only appears in L,. Taking just the symbols local to Li produces an ADS|(t)- 
sequence. Therefore, the number of local symbols is at most n|(t) per layer, or at most 
bUlit) < (1 + t) ^lit) altogether. 

Symbols which appear in at least two layers are called global symbols. 

Call a global symbol left- concentrated for layer Li if it makes its first appearance in Li, 
and it appears at least three times in Lj. Given a layer Li, take just the left-concentrated 
symbols for Li, and just their occurrences within Li. The resulting sequence 5^ cannot 
contain an alternation abab, or else S would contain ababa. Therefore, S"- is an ADS|(t)- 
sequence, so by Lemma |4.4| it has at most t — 2 different symbols. Thus, there are at 
most b{t — 2) < (1 + y)!^ — 2) < m left-concentrated symbols altogether (since t < y/m). 

Similarly, there are at most m right- concentrated symbols. 

Next, call a global symbol middle- concentrated for layer Lj if it appears at least twice 
in Li, and it also appears before Li and after Lj. 

Given Li, take just the middle-concentrated symbols for Li, and just their occur- 
rences within Li. The resulting sequence S'- cannot contain an alternation aba, so S'l is 



an ADS2(t)-sequence, and so by Lemma 4.3 it contains at most t — 1 different symbols. 



Therefore, there are at most b{t — 1) <m middle-concentrated symbols. (Note that we 
might have counted the same middle-concentrated symbol more than once.) 

Finally, take all the global symbols we have not accounted for so far — the scattered 
symbols. Each of these symbols must appear in at least k — 2 different layers. Build 
a subsequence of S by taking just the scattered symbols, and for each scattered sym- 
bol, just one occurrence per layer. Each layer becomes a block, and no new forbidden 
alternation can arise. Hence, we get an ADS|_2(^)"Sequence, which can have at most 
n|_2(l + y) different symbols. □ 

Corollary 4.8. There exists an absolute constant c such that, for every k >2, we have 

^2k+ii''^) — cmak{m) for all m. 

Proof. Let niQ be a constant large enough that 

m > 1 -\- 9[log2 m]'^ for all m > mo- 

We will work with a slight variant of the inverse Ackermann function. For this proof, let 
ak{x), k >2,he given by a2(x) = a2{x) = [log2x], and, for A; > 3, by the recurrence 



ak{x) 



1, if X < mo; 

1 -|- QA;(3Qfc-i(a;)), otherwise. 
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Note that ak{x) is well-defined by our choice of ttt-q. Furthermore, there exists a constant 
Co such that |afc(x) — ak{x)\ < cq for all k and x (see Appendix B of or Appendix [C] 
in this paper). 

We will prove by induction on A; > 2 that 

nifc+iC'^) — Cimak{m) for all m, 



for some absolute constant ci; this implies our claim. By Corollary 4.6 we have n|(m) = 
0(m log m), so the base case k = 2 follows by choosing ci sufficiently large. We choose 
ci large enough so that it also satisfies ci > 12 and that ci > Iij{m)/'m for all m < mo. 

Now, let k > 3, and assume the bound holds for k — 1. To establish the bound for 
k, first let m < rriQ. Then we have 

nifc+il"^) ^ 117(772) < cim = cimak{m), 

since Ilf,{m) is nonincreasing in k for fixed m. Thus, let 777 > 7770. We apply Recur- 
with t = 3ak-i{rn) . (Note that t < 1/777 for 777 > 7779 by our choice of 7770.) By 



rence 



4.7 



the induction assumption for k — 1 we have 

„n ( m\ n f 2m\ 2ci 777^ / 2m\ 2c^m ^ , , 2ci777 
ni,_,(l + -) <n3,_, - <_a.., - <— a.„,(777) = — . 



Substituting into Recurrence 4.7 and letting Wi^^^^im) = mg{m), 

/ N ^ /,N , n2fc+i(t) 2ci 

5(777 < g{t) + + ^ + 3 

m 6 



< g{t) H ^ + 4 (since, by Lemma 4.4 I^ik+iif) <t^ <m) 

3 



< g{t) + ci (since ci > 12). 

Since 3fc(t) = 3^(777) — 1, it follows by induction on 777 (with base case m < mo) that 

g{m) < ci3fc(m) for all 777. 

Therefore, 

112(^+1(777) < cimak{iTi) for all m. □ 



The bound for tpa^m, n) in Lemma 2.3 now follows from Corollary 4.8 and Lemma 4.1 



4.2 Obtaining Klazar's improved upper bound for X^ln) 

Klazar's tighter upper bound ([T]) for Xs{n) follows by using the following relation between 



A3(r7) and 7/^3(777,77), instead of Lemma 2.2 



Lemma 4.9 (Klazar |8 ). We have X^^n) < 7/^3(1 + 277/^, 77) + 3r7^, where i < n is a free 
parameter. 
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(Klazar actually proved this relation under a stricter definition of ^3(771, n).) For 
completeness, we prove Lemma |4.9|in Appendix [A) 



Corollary 4.10. A3(n) < 2na{n) + 0[ny^a{n)) . 



Proof. Taking s = 3 and k = 2a{m) + 1 in Lemma 4.1, and bounding ^2a(m)+i^^^ 
Corollary |4.8[ we get 

ip3{m,n) < (2a(m) + 1) (cma^(„)(m) + n) = 2na{m) + n + O (^ma{m)Y 



We now apply Lemma 4.9 with £ = \J a(n). □ 



4.3 Bounding n|(m) for general s 

The following recurrence and corollary for n|.(m) are analogous to Recurrence 3.6 and 
Lemma 3.8 in [3]: 

Recurrence 4.11. Let s > 3 be fixed. Let ki, k2, k^ be integers, and put k = /C2/C3 + 
2ki -3k2-k3 + 2. Then, 



n|(m) <[i + j) [um + 2Ui-/{t) + ni-/it)) + + ^ 

where t is a free parameter. 

Proof. Take a sequence S that maximizes n^(m). Again partition the m blocks of S 
into b = [m/t] < 1 + m/t layers Li, . . . , L^, with at most t blocks per layer. 

We again classify the symbols of S into local (if the symbol appears in only one 
layer), or global. As before, there are at most (l + n|(t) local symbols. 

And we again classify the global symbols into left-concentrated, right- concentrated, 
middle- concentrated, and scattered. This time we do this as follows: 

A global symbol is left- concentrated for layer Li if its first ki occurrences fall in 
Lj. The overall number of left-concentrated symbols is at most (l + ™) n^~^(t). Right- 
concentrated symbols are defined and handled analogously. 

A global symbol is middle-concentrated for layer Lj if it appears at least k2 times 
in Li, and it also appears before Lj and after Li. There are at most (1 + f) Ul'^it) 
middle-concentrated symbols altogether. 

Finally, a global symbol is scattered if it appears in at least k^ different layers. Taking 
just these symbols, and for each symbol, just one occurrence per layer, we obtain an 
ADS|g(6)-sequence. Thus, there are at most Ilf,^{b) < 0^1^(1 -|- ™) scattered symbols. 

All that remains is to show that we did not miss any global symbol. Suppose a global 
symbol is neither left-, middle-, nor right-concentrated, nor scattered. Then the symbol 
appears at most 2(ki — 1) -|- (^3 — 3){k2 — 1) = k — 1 times in S, a contradiction. □ 



The only significant difference between Recurrence 4.11| above and Recurrence 3.6 
in [3] lies in the formula for k in terms of ki, k2, and ^3. (The formula there is A; = 
k2k3 -\- 2ki — 2k2.) But in both cases we get the same asymptotic behavior: 
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Corollary 4.12. Define Rs{d) for s>l, d>2 by Ri{d) = 2, R2{d) = 3, and fors>3 
by 

Rs{2) = 2'-^ + 1, 

Rs{d) = Rs{d - l)Rs-2{d) + 2Rs-i{d) - 3Rs-2{d) 
- Rs{d-l) + 2, ford>3. 

Then, for every s > 2 and d >2, if k > Rs{d) then 

n|(m) < cmad{m)^^'^ for all m. 

Here c = c(s) is a constant that depends only on s. 

Proof. By induction on s, and on d for each s. (Recall that n^(m) is nonincreasing in 
k for fixed s and m.) The base case s = 2 is given by Lemma |4.4[ For s = 3 we have 



R^{d) = 2d + 1, and the claim is equivalent to Corollary 4.8 Therefore, let s > 4 be 
fixed, and assume the claim holds for s' < s. 

Let mo = mo(s) be a constant large enough so thalj^ 

2 

m > 1 + 12''[log2 rnY for all m > mo. 

We again work with a slight variant of the inverse Ackermann function. For this proof 
define ad{x), c? > 2, by a2{x) = a2{x) = [log2 x] , and for d > 3 by the recurrence 



1, if X < mo; 

1 + 3(i(l2a(i_i(2;)*^^) , otherwise. 



The functions ad{x) are well defined by our choice of mo. And as before, there exists a 
constant cq (depending only on s) such that |Srf(x) — ad{x)\ < cq for all d and x. 

We will show, by induction on d, that there exists a constant ci (depending only on 
s) such that, for all d > 2 and all m, we have 

n|,(m) < cimadimy-'^ for k > Rs{d). (28) 

This is easily seen to imply the claim. 

The base case d = 2 follows from Corollary 4.6 provided ci is chosen large enough. 
Further, by induction on s we know there exist constants C2, C3 (depending on s), such 
that 

n^-^m) < C2mSrf(m)"-=^ for k > Rs-i{d), 
^l~\rn) < C3mad{my-^ for k > Rs-2id), 

for all d > 3 and all m. We choose ci large enough so that it also satisfies ci > 6c2, 
ci > 6C3, and 

ci > n^^^3^(m)/m, for all m < mo. (29) 



^The dependence of mo on s here could be greatly improved with a slightly more careful analysis. 
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Now, let d > 3, and suppose (28) holds for d — 1. To establish (28) for d, assume 
first that m < rriQ. Then, by (29), for all k > Rs{d) we have 

n|(m) < n^^^3^(m) < cim = cimad{'m)^~'^ . 

Thus, let m > mo- Apply Recurrence |4 . 1 1 1 with the following parameters: 

ki = Rs-iid), k2 = Rs-2{d), k3 = Rsid - 1), 
k = Rsid), t = 12arf_i(m)'-2. 

The last three terms in Recurrence 14.111 can be bounded as follows. 

2Ul-;\t) < 2c2tad{ty~' < |tS,(m)^-3, 



ci 



\s-3 



m 



< n 



2m 



< 



6 

2cim 



—ad-i{m) = —m < —mad{m) 



Substituting into Recurrence |4.11 we get 



n|(m) < + ^-:^ma'{mr~' +UUt) + ^tadim] 



2ci 



ci 



s-3 



But by Lemma 4.4 we have n^(t) <t^^< (l2[log2 mY ^ , which is at most m for 



m > mo by our choice of mo. In turn, m is at most cim/6, since ci > 6. 

Similarly, we have Y^^dC"^)*"^ < cim/6 for m > mo by our choice of mo. Thus, 



m. 



nfc(m) < —Ul{t) + cima{m 



Letting n|(m) = mg{m), we get 

5(m) < + ciad(m)"-3 
Since Sc;(t) = 3d(m) — 1, it follows by induction on m that 

5((m) < ciSd(m)'^^^ for all m. 



(The base case m < mo follows from (29), and for the induction on m we apply (3d(m) — 
l)^ ^ < (3(i(m) — l)arf(m)''~^.) Therefore, 

n|(m) < cimad{'m)^~'^ for all m. □ 

Let us now study the asymptotic growth of Rs{d) for fixed s. We have 

R3{d) = 2d + 1, R^{d) = 5-2'^ -M-3. 

In general, letting t = [{s — 2)/2j, we have 

^2(iA!)<i*±0(d*-i)^ seven; 



Rsid) 



(30) 



(see Appendix 



Lemma 2.3 



B] again) . 

now follows from Lemma 4.1 by applying Corollary 4.12 with k = Rs{d). 
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5 Bounding format ion- free sequences 



We now deal with the generahzations of Davenport-Schinzel sequences described in the 



Introduction. Recall that the first step in bounding Ex„(?7-) is Lemma 1.4 which claims 
that Extt(n) < Fr^s-r+i{n), where r = \\u\\ and s = \u\. 



Proof of Lemma I.4 Suppose u = U1U2 ■ . .u^, where 1 < Ui < r for each i. We can 
assume that the symbols in u make their first appearances in the order 1, 2, . . . , r. 

Let s' = s — r + 1, and let I = iii2 ' ' ' ^s' be an arbitrary (r, s')-formation, where 
each ij is a permutation of {1, . . . , r}. We want to show that u C £. 

Define a partition u = B1B2 ■ ■ ■ Bg' of u into s' blocks as follows: First let each 
symbol of u constitute its own block of length 1. Then, for each 2 < j < r, merge 
the block that contains the first occurrence of j in u with the block containing the 
immediately preceding symbol. The number of blocks goes down from s to s'. 

Here is an example of a sequence thus partitioned: 

« = [1][1][12][134][2][4][1][25][5]. (31) 

Clearly, each block Bj is an increasing sequence. 

Now we are going to define a permutation o" on {1, . . . , r} such that, for each block 
Bj with 1 < J < s', its image cr{Bj) is a subsequence of ij. We do this by examining the 
blocks from right to left, and by defining a in the order a{r),a{r — 1), ... , o"(l). Note 
that blocks of length 1 can be safely ignored. 

Suppose we have already dealt with blocks Bgi, Bg/^i, . . . , -Bj+i, and that now is the 
turn of block Bj, where \Bj\ > 1. Let k be the last symbol in Bj. The symbols preceding 
k in Bj are A; — 1, /c — 2, . . ., up to the second symbol of Bj. All these symbols make 
their first appearance in u in Bj. Call these the "new" symbols of Bj. 

Suppose we have already assigned values to a{k + 1), . . . ,(T{r) in such a way that, 
no matter how we assign cr(l), . . . , a{k), the images a{Bj^i), . . . , a^Bgi) will always be 
subsequences of ij+i, ... ,£5', respectively. 

Now consider the symbols of ij. Call a symbol of ij "free" if it has not yet been 
assigned as image a{i) to any symbol i, for k + 1 < i < r. 

We scan ij form right to left, considering only its free symbols, and we assign in a 
greedy fashion these free symbols as images cr^k), a{k — 1), . . . to k,k — 1, . . . (the "new" 
symbols of Bj ) . 

After we are done with these assignments, the only symbol of Bj which has not been 
assigned an image is the first symbol of Bj — call it bj. But no matter how we define 
(7{bj) later on, we will always have that a{Bj) is a subsequence of ij (because of our 
greedy approach). 

At the end, the assignment cr{l) of 1 will be forced. 



For example, with u is as in (31), suppose that 



?i £2 32514 35421 4 4 £7 35142 
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(where ii, £2, £5, isi ^Tj ^9 do not matter). Then, our algorithm will assign (t(5) = 2, 
cr(4) = 1, £7(3) = 4, cj(2) = 5, and finally a{l) = 3. Then the sequence 



a(n) = [3] [3] [35] [341] [5] [1] [3] [52] [2] 
is a subsequence of £, as desired. □ 



Remark 5.1: Lemma 1.4 is not the last word in finding sequences in formations. For 
example, consider the sequence u = abcabca. Lemma |1.4| states that u is contained 
in every (3, 5)-formation, but in fact u is contained in every (3, 4)-formation: Let i = 
^i^2^3^4 be a (3, 4)-formation. Suppose ii = abc. Then, if u itself is not a subsequence 
of £, then £2 must have b before o, £3 must have c before b, and £4 must have a before 
c. But then £ contains the subsequence cbacbac. 

5.1 Bounding the length of formation- free sequences 

Thus, the problem of bounding Exu(n) reduces to that of bounding i*V,s(n). For com- 
pleteness, we start by reproducing some simple bounds from [7]. We first prove that 
Fr^s{n) is finite. 

Lemma 5.2 (Klazar [7]). We have Er,s{n) < srf for n > r. 

Proof. Let S be an (r, s)-formation-free sequence on n distinct symbols. Partition 5 
from left to right into blocks of length r. Note that each block contains r distinct 
symbols. Suppose we had 1 + (s — 1)(") complete blocks. Then, by the pigeonhole 
principle, there would exist s blocks that have the same set of r symbols. Such a set of 
s blocks would be an (r, s)-formation. Contradiction. 
Therefore, we must have 

\S\<r(l + {s-l)('^]] <rs(''] <sn'-. □ 



It is also easy to get linear bounds for -Fr,2('i-) and Fr^s{n): 

Lemma 5.3 (Klazar [^J). We have Fr^2in) < rn and F,.^3(n) < 2rn. 

Proof. Let S be an r-sparse sequence on n distinct symbols. Again partition S from left 
to right into blocks of length r (the last block might be shorter) . 

If S contains no (r, 2)-formation then every block must contain the first occurrence 
of a symbol, and if S contains no (r, 3)-formation, then every block must contain the 
first or last occurrence of a symbol. Thus, there are at most n blocks in the first case, 
and at most 2n blocks in the second case. □ 

Lemma 5.4 (Klazar [7]). Let S = S1S2 ■ ■ ■ Sm be a sequence which is a concatenation 
of m blocks, where each block Si contains only distinct symbols. Then S can be made 
r-sparse by deleting at most (r — l)(m — 1) symbols. 
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Proof. Build an r-sparse subsequence S' of 5 in a greedy fashion, by scanning S from 
left to right and adding a symbol from S to S' only if it does not equal any of the last 
r — 1 symbols currently in S' . In this way, we will skip at most r — 1 symbols of each 
block Si, except for the first block 5i, which we will take entirely. □ 



Next, we make a definition analogous to Definition 2.1 



Definition 5.5: Given integers r, s, m, and n, we denote by ip' j.^g{m,n) the length of 
the longest r-sparse, (r, s) -formation- free sequence on n distinct symbols that can be 
partitioned into m or fewer blocks, each block containing only distinct symbols. 

Remark 5.6: The reader need not be intimidated (more than necessary) by the double 
subscript r, s in n). We are never going to use induction on r, only on s. Thus, 

r can be assumed to be fixed throughout our analysis. 



The following lemma (analogous to Lemma 2.2) relates Fr^s{n) to ip' g{m,n). 

Lemma 5.7. Given fixed integers r and s, let ipr,s-2in) be a nondecreasing function of 
n such that Fr^s-2{n) < nipr.s-2{n) for all n. Then, 

Fr,s{n) <2n + (pr,s-2{n) (2(r - l)n -|- V'V,s(2n, n)) . 

(This constitutes a minor improvement over Klazar [7], since Klazar related Fr^s{n-) 
to ipr,s~i{n).) 

Proof. Let S be a maximum-length (r, s)-formation-free sequence on n symbols. Thus, 
IS"! = Fr^sin). Partition S from left to right into subsequences as follows: 

Let Si be the longest prefix of S that is (r, s — 2)-formation-free. Let xi be the 
symbol following Si in 5. Thus SiXi contains an (r, s — 2)-formation. Let S2 be the 
longest segment of S after xi which is (r, s — 2)-formation-free, let X2 be the symbol 
following 52 in S, and so on. 

We obtain a partition S = S1X1S2X2 . . . Xm-iSm[xm], where each Si is a subsequence 
and each Xi IS 3j symbol {xm might or might not be present). 

Each subsequence SiXi must contain either the first or the last occurrence of some 
symbol, for otherwise S would contain an (r, s)-formation. Thus, m < 2n. 

Let rii = \\Si\\. Then 

m m 

j| < m -I- ^ Fr^s-2{ni) 
i=i i=i 

m m 
< 2n ^ niipr,s-2{ni) <2n + ipr,s~2{n) ^ 



rii. 

i=l 1=1 



So we just have to bound ^ rii. Construct a subsequence S' of S by taking, for each 
subsequence Si in the above partition of 5, just the first occurrence of each symbol in 
Si. Thus, \S'\ = "^rii. Next, using Lemma 5.4, "r-sparsify" S' and obtain a sequence 
5" with |5"| > \S'\ - (r - l)(m - 1). 
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Since S" is a subsequence of S, it contains no (r, s)-formation. Further, S" is r-sparse 
and partitionable into m blocks of distinct symbols. Therefore, 15"'! < V''r,s("^' 
so 

m 

^ni = \S'\ < (r - l)m + \S"\ < 2(r - l)n + VV,5(2n, n). 

i=l 

The claim follows. □ 

We now apply our "almost-DS" technique to formation-free sequences. For this, we 
introduce and analyze "almost-formation-free" sequences. The analysis closely parallels 
the analysis of almost-DS sequences. 



5.2 Almost-formation-free sequences 

If S is a sequence, we say that S is an AFFr^s,k{'m) sequence if S contains no (r, s)- 
formation, can be partitioned into m of fewer blocks, each composed of distinct symbols, 
and each symbol appears at least k times (in k different blocks). 

Note that we do not require r-sparsity; this is the reason for calling S "almost" 
formation-free. 

Let nV,s,fc(m) be the maximum possible number of distinct symbols in an AFFr,5,fc(m) 
sequence. 

We first show the connection between AFF sequences and ip' ^^g{m,n), and then we 
derive upper bounds for n'r s fc(m). 

Lemma 5.8. For all s >2 and k we have il}'^^g{m,n) < A;(n'j.,s,A;(m) -|- n). 

Proof. Let 5 be a maximum-length r-sparse, (r, s)-formation-free sequence on n distinct 
symbols, partitionable into m blocks. Thus, l^l = tp' ^ g{m,n). Let > 1 be the specified 
parameter. 

Transform S into another sequence S' in which each symbol appears exactly k times 

as follows. For each symbol a, group the occurrences of a from left to right into "clusters" 
of size /c, discarding the < k occurrences left at the end. Replace each a in the i-th cluster 
by a new symbol Oj. 

If s > 2, then this does not introduce any (r, s)-formations. (Proof: Call two symbols 

a and b disjoint if every occurrence of a lies before every occurrence of h or vice-versa. 
Note that if a and h are disjoint, they cannot belong to the same (r, s)-formation for 
s > 2. Thus, if S' contains an (r, s)-formation, that formation was already present in 
S.) 

We deleted at most kn symbols from S, and the result S' is an AFFr,s,fe("^) sequence 
(S" is not necessarily r-sparse, but this is fine for an AFF sequence). Therefore, S' 
contains at most li' r,s,k{^) symbols, each one appearing exactly k times, so it has 
length at most k ■ nV,s,fc('Ti). The claim follows. □ 

Lemma 5.9. For every r >2 we have U.'r,2,2{rn) = (r — l)(m — 1). 
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Proof. For the upper bound, consider m — 1 "separators" between the m blocks. We say 
that a symbol a "contributes" to all the separators between the first two occurrences of 
a. Thus each symbol contributes to at least one separator. If there were l + (r — l)(m — 1) 
symbols, then there would exist a separator with at least r contributions, which would 
lead to the existence of an (r, 2)-formation. 

For the lower bound, create m blocks, and create n = (r — l)(m — 1) different symbols 
partitioned into m — 1 sets Ai, . . . , Am-i of r — 1 symbols each. Make two copies of each 
Ai, and put one copy at the end of block i and one copy at the beginning of block i + 
We get a sequence with the desired properties. □ 

Lemma 5.10. For every fixed r > 2 and s > 3 we have Il'r,s,s{n^) < — ^){"^l2) ~ 

Proof. Suppose for a contradiction that there is an AFFr,s,s(m) sequence with 1 + (r — 
1) ('^~2^) distinct symbols. Consider the s — 2 middle occurrences of each symbol. They 
fall on s — 2 out of m — 2 different blocks. Therefore, by the pigeonhole principle, there 
exist r symbols whose s — 2 middle occurrences all fall in the same s — 2 blocks. This 
leads to the existence of an (r, s)-formation in the given sequence. Contradiction. □ 

Recurrence 5.11. We have 

nV,s,2fc-i(2m) < 2Il'r^s,2k-i{rn) + 2nV,s-i,fe(H- 



The proof is exactly parallel to that of Recurrence 4.5 



Corollary 5.12. For fixed r > 2 and s > 3, if we let k = 2^ ^ + 1, then 

nV,,,fc(m) =0(m(logm)^"3) 

(where the constant implicit in the O notation might depend on r and s). 

Recurrence 5.13. Let r > 2 and s > 3 be fixed. Let ki, k2, k^, and k he integers 
satisfying /c = A;2^3 + 2ki — 3k2 — k^ + 2. Then, 



where t is a free parameter. 



The proof exactly parallels that of Recurrence 4.11 The corollary is almost the same 
as Corollary 4.12[ there is just a shift of 1 in the index s: 



Corollary 5.14. Let Rs{d) be the sequences defined in Corollary 4-13 Then, for every 
s>3 and d>2, if k > Rs-i{d) then 

n'r-,s,fc(w-) < cmad{jny~^ for all m > k. 

Here, c = c(r, s) is a constant that depends only on r and s. 

Combining Corollary |5.14 with Lemma 5.8 we obtain: 
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Corollary 5.15. Let s > 4. Then, for all r, m, and n we have 

il)' ^ g{rn, n) < Cr^s,d{innad{rny~^ + n) for all d, 
for some constants Cr,s,d of the form 



where t = [{s - 3)/2j. 

We can finally prove our upper bounds for Fr^s{n). 



Proof of Theorem 1.3 Take d = a(m) in Corollary 5.15 then substitute into Lemma 5.7 
bounding ipr,s~2in) by induction on s. Use the base cases Fr^2{n), Fr^3{n) = 0{n) (by 
Lemma 5.3). (As before, '~pr^s-2{n) contributes only to lower-order terms in the expo- 
nent.) □ 



6 The lower bound construction for s = 3 

The rest of this paper deals with lower bounds for Davenport-Schinzel sequences. In 



this section we prove Theorem 1.6 by constructing, for every n, a Davenport-Schinzel 
sequence of order 3 on n distinct symbols with length at least 2na{n) — 0{n). 

For this purpose, we first define a two-dimensional array of sequences Zd(m), for 
d,m>l, with the following properties: 

• Each symbol in Zd{m) appears exactly 2d + 1 times. 

• Zd{m) contains no forbidden alternation ababa. (We do not preclude the presence 
of adjacent repeated symbols in Z(i{m).) 

• Z(i{m) is partitioned into blocks, where each block contains only distinct symbols. 
Some of the blocks in Zd^m) are special blocks. Each symbol in Zd{m) makes its 
first and last occurrences in special blocks. Furthermore, the special blocks are 
entirely composed of first and last occurrences of symbols (there might be both 
first and last occurrences in the same special block). Moreover, each special block 
in Zci{m) has length exactly m. 

• For d > 2, each special block is surrounded by regular blocks on both sides, and 
no regular block is surrounded by special blocks on both sides. For the former 
property, we place empty regular blocks at the beginning and end of Zrf(m), for 
d>2. 

In what follows, we enclose regular blocks by ()'s, and special blocks by []'s. 
The base cases of the construction are as follows: For d = 1, we let 

Zi{m) = [12...m](m...21)[12...m]. 
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Zi(m) contains three blocks of length m; the first and last ones are special blocks. 
Note that each symbol appears exactly three times, as required. Also note that Zi{m) 
contains no alternation ahaba. 
For m = 1 and d > 2 we let 

Z,(l) = ()[l](l)(l)...(l)[l](), 

with 2d+l ones. Each symbol constitutes its own block; the first and the last nonempty 
blocks are special. Note that these special blocks have length 1, as required. At the 
beginning and end there are regular blocks of length zero. 
Denote by Sdim) the number of special blocks in Z^{m). 

The recursive construction. For d,m >2, we construct Zd{m) recursively as fol- 
lows. Let Z' = Z(i{m — 1). Let / = Sd{m — 1) be the number of special blocks in Z', 
and let Z* = Zd-i{f). Thus, the special blocks in Z* have length /. Let g = Sd-i{f) 
be the number of special blocks in Z* . 

Create g copies of Z', each copy using "fresh" symbols which do not occur in Z* nor 
in any preceding copy of Z' . Thus, we have one copy of Z' for each special block in Z* . 
Furthermore, each special block in Z* has as many symbols as there are special blocks 
in the corresponding copy of Z' . 

Let Ci be the i-th. special block in Z* , and let Z- be the i-th copy of Z' . Let a be 
the ^-th symbol in Ci, and let Di be the ^-th special block in Z'-. We duplicate a into 
aa, and we insert the oo into Z'- as follows: 

If the a in Ci is the first a in Z* , then the first of the two a's falls at the end of D£ 
and the second a falls at the beginning of the block after D^. Otherwise, if the a in Ci 
is the last a in Z* , then the first of the two a's falls at the end of the block before De 
and the second a falls at the beginning of Di. (Recall that is surrounded by regular 
blocks in Z[.) 

Since no regular block in Z'^ is surrounded by special blocks on both sides, it follows 
that no block in Z- receives more than one symbol from Z*. Thus, even after the 
insertions, no block in Z'- has repeated symbols. 

After these insertions, at the place in Z* where the block Ci used to be there is now 
a hole. We insert Z- (with its extra symbols) into this hole. After doing this for all 
special blocks Ci in Z*, we obtain the desired sequence Zd{m). See Figure [!} 

It is easy to check that every symbol in Zdi^m) has multiplicity 2d + 1: The symbols 
of the copies of Z' already had multiplicity 2d + 1, and the symbols of Z* had their 
multiplicity increased from 2d — 1 to 2d + 1. 

It is also clear that each symbol makes its first and last occurrences in special blocks, 
that the special blocks in Zd{'m) contain only first and last occurrences, and that their 
length increased from m — 1 to m. Furthermore, every special block is surrounded by 
regular blocks on both sides, and no regular block is surrounded by special blocks on 
both sides. And Zd^m) contains empty regular blocks at the beginning and at the end. 

No ahaba. Let us now verify that Z(i{m) contains no alternation ahaba of length 5. 
Assume by induction that this is true for the component sequences Z' and Z* . 
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( )[12 3]( )•••( )[2 4 5]( )■ 





(1)[1 ]■••[ 2](2 )•■•[ 3](3)^ ^(2)[2 ]-(4)[4 ]■■■[ 5](5) • 
copy of Z' \ \ I copy of Z' 

/ special blocks 



Figure 1: Construction of Z^(m) from Z* and many copies of Z' . Two special blocks of 
Z* are depicted. In the left one, the symbol 1 makes its last occurrence, and symbols 
2, 3 make their first occurrence. In the right block, symbols 2 and 4 make their last 
occurrence, and symbol 5 makes its first occurrence. 



Suppose for a contradiction that Zfi{m) contains an alternation ababa. The symbols 
a and b cannot come from the same copy of Z' , by induction, and they cannot come 
from different copies of Z', since they would not alternate at all. 

Further, a and b cannot both come from Z*: By the induction assumption, Z* 
contains no forbidden alternation. And the duplications of symbols a ^ aa cannot 
create a forbidden alternation, since the two a's end up being adjacent in Zrf(m). 

Next, suppose that a comes from a copy of Z' and b comes from Z* . Then this copy 
of Z' received two non-adjacent Vs. But this is impossible by construction: Our copy 
of Z' received symbols from a single special block of Z*, which contained at most one 
b. This b was duplicated into two adjacent copies bb. 

Finally, suppose that a comes from Z* and b comes from a copy of Z' . Then this 
copy of Z' received an a that is neither the first nor the last a in Z* . This is also a 
contradiction. 

Remark 6.1: The above construction shares some similarities with an earlier construc- 
tion by Komjath [TO]. 

6.1 Analysis 

Recall that Sd{m) denotes the number of special blocks in Zd{m). We define a few other 
quantities related to Zrf(m): 

• Nfi{m) = \\Zd{m)\\ denotes the number of distinct symbols in Zd{m). 

• Ld{m) = \L(i{rn)\ denotes the length of Zd{m). 

• M(i{m) denotes the total number of blocks (regular and special) in Zd{m). 

• We let Xii{rn) = M^^rn) / Sdipi). Thus, Xdirn)^"^ is the fraction of blocks in Zdijn) 
that are special. 
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• We let Vdim) 
Note that 



Lii{m)/Md{m) denote the average block length in Zrf(m). 



Nd{m) 
Ld{m) 



1 



mSd{m), 



i2d+l)Nd{m) 



d + 



mSd{m). 



(32) 
(33) 



Equation (32) follows from the fact that each symbol appears in two special blocks, and 
each special block contains m symbols. Equation (33) follows from the fact that each 
symbol appears 2d + 1 times in Zd{m). 

Theorem 1.6 follows from the following facts: 

Lemma 6.2. The quantity Nd{m) experiences Ackermann-like growth. Specifically, 
there exists a small absolute constant c such that 



Adim) < Nd{m) < Ad{m + c) 

for all d> 3 and all m>2. 

We also have Xd{m) <2d+\ and Vd{m) > m/2 for all d and all m. 

Let us first see how this lemma implies Theorem |1.6[ 



(34) 



Proof of Theorem Diagonalize by taking the sequences Z*^ = Zd[d) for d = 1,2,3,.... 



Let iV* = Nd{d), L* = Ld{d), and V* = Vd{d). 



By (|34|) and (g we have N* < Ad{d + c) < Ad{A{d + 1)) = A{d + 2). Thus, 

A{d) <N^< A{d + 2) (35) 

for all d>4. Thus, by (|7|, 



for d> 4, and so, by (33), 



a{N*,) -2<d< a{Nl 



Ll>2Nl.a{Nl)-0{Nl). 



(36) 



The sequences Z*^ are not necessarily Davenport-Schinzel sequences, since they might 
have adjacent repeated symbols. Therefore, create sequences Z'^ by removing adjacent 
repetitions from ZJ. Since we delete at most one symbol per block, the length of 
decreases by at most a 1/V^* fraction. But by Lemma 6.2 this ratio tends to zero with 
d (this is why we diagonalized) . Specifically, the length of Z'^ is 



L'd > I 1 



>L*d\l 



d 



>2Nl-a{N*,)-0{N*,). 



(37) 



We have just proven that A3(n) > 2na(n) — 0{n) for n of the form n = N^. We just 
have to interpolate to intermediate values of n. Given n, let d = d{n) be the unique 
integer such that 

m < iVdVi <n< N*,+2- 
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It follows, by applying (36) twice, that 



a{n) < a{N2^^) <d + A< a{N2) + 4. 
Also, by the rapid growth of N*^ in d, we certainly have 



N*, < JN* < 



(38) 



(39) 



for d > 4. 

We now concatenate many copies of Z'^ with disjoint sets of symbols, making sure we 
do not have more than n distinct symbols altogether. Specifically, we let t = [n/N^\, 
and we let Z"{n) be a concatenation of t copies of Z'^ with disjoint sets of symbols. 

By (pTl), (l38|, and (pOl, it follows that the length of Z"{n) is 



L"(n) = tL', > 



n 



1 {2NI . a{Nl) - 0{NI)) = 2na{n) - 0{n) 



Since A3(n) > L"{n), the bound follows. 
All that remains is to prove Lemma |6. 2 



□ 



Proof of Lemma \K^ The quantity Sd{m) is given recursively by 



Slim) = 2; 
Sd{l) = 2; 

Sd{rn) = fg = Sd{m - l)Sd-i (5'd(m - 1)) , 



for d, m > 2. 



(40) 



In particular, we have 52(m) = 2"* = A2{m), and 5^(2) = 2*^. 

It is not hard to show (see Appendix [Cj that there exists a small constant cq such 
that 

Ad{m) < Sd{m) < Ad{m + cq) (41) 



for all d > 2 and all m. Then, by (32 ) we have, for d > 3, m > 2, 

Sd{m) < Nd{m) < Sd{mf < Sd{m + 1), 



so ( 34 ) follows with c = cq + 1 . 
Regarding Md{m), we have 

Mi(m) = 3; 

Md{l) = 2d + 3, ford>2, 

counting the empty blocks at the ends of Zd{l). And for d, m > 2, we have 

Md{m) = gMd{m - 1) + Md-i{f) - g 

= Sd-i {Sd{m - 1)) {Md{m - 1) - l) + Md-i {Sd{m - 1)) 



(42) 
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(since the g special blocks of Z* disappear). In particular, we have M2{m) = 2'^^'^ — 1, 
and Md{2) = 2'^+^d-l. 

Let us now examine Xrf(m) = M(i{m) / Sd{m). We have 



Xi{m) 


= 3/2, 


X2{m) 


= 4-2-"", 




= (i + 3/2. 


Xd{2) 


= 2d -2-'^ 



In general, dividing (42) by (40) 



Y( ^ Y( , X,_i(g,(m-1))-1 

Xd[rn) = Xrf(m - 1 + — . 

bdim - 1) 



(43) 



We now prove by induction that Xdijn) < 2(i + 1 for all d and m. The claim has 



been verified for d<2 and for m < 2, so assume d, m > 3. By (43) and using induction 
on d, we have 

2d — 2 

Xdira) < Xd{m - 1) + 



so 



Sd{m- !)■ 



Xd[m) < Xdi2) + {2d -2)J2 Sdim)-' = 2d - 2"^ + {2d - 2) ^ 5d( 



ni 



m=2 



m=2 



It is easily checked that, for d>3, 



J2 Sd{m)-' < 2Sdi2)-' = 2"-" < 



1 



m=2 



2d -2 



It follows that Xd{m) < 2d + 1, as desired. 



Finally, let us consider Vd{m). By (33) we have 



m m 
Xd{m) - 2- 



□ 



Remark 6.3: The coefficient 2 in our bound for \2,{n) comes from the fact that each 
symbol appears roughly 2d times in Zd{m). In previous constructions [201 [101 US] each 
symbol appears only d it 0(1) times in the equivalent sequence. Sharir and Agarwal |16| 
lost an additional factor of 2 in the interpolation step; we avoided this loss in the proof 



of Theorem 1.6 by letting Z"{n) consist of many copes of Z'^, instead of using Z'^j^^ 
(which would have been a more obvious choice). 
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6.2 Lower bound for the number of symbols in almost-DS sequences 
of order 3 



In Section |4] we introduced the notion of almost-DS sequences. We derived an upper 
bound on the maximum number n|(m) of distinct symbols of an ADS^(m)-sequence, 
and we used this upper bound to bound Xs{n). 

But the problem of ADS|(m-)-sequences is interesting in its own right, so one might 
naturally wonder about matching lower bounds for n|(m). 

It turns out that the construction Zd{m) described in this section also provides a 
roughly-matching lower bound for n^(x). We just have to change our point of view: 
Instead of taking a diagonal (namely, we take the rows of the construction 

(meaning, Zd^ni) for fixed d). 

Lemma 6.4. For every fixed d>2 we have n|^^-|^(x) = 1^(^x0^(2;)). 

Proof. For every m > 1, the sequence Zd{m) is an ADS2^_|_]^(xm)-sequence for Xm = 
M(i{m). Let = N^(m) be the number of distinct symbols in Z^{m). 



By the definition of Xd^m), and by applying Lemma 6.2 and then (41 ), we have 

Xm = Md{m) = Xd{m)Sd{m) < {2d + l)Sd{m) 

<{2d+l)Adim + co)<Adim + co + l). 



Thus, by ^ we have tti ^ ci^(x^ ) - Co - 1. Therefore, by (p3|, applying Lemma ^ 



again, and applying (44), 



We interpolate to intermediate values of x (for Xm < x < Xm+i) as we did above, in 
Section 16. li □ 



Thus, for odd d the bounds for I\^{m) are quite tight (they leave a multiplicative 
gap of 0{d)). For even d the bounds are not so tight — they are obtained by applying 
n3+i(m) <n3(m) <n3_,(m) 



Lemma 6.4 automatically yields a lower bound for nV,4,fc("i-): A sequence that does 
not contain ababa cannot contain an (r, 4)-formation for any r > 2; further, as Adamec, 
Klazar, and Valtr [1^ showed, an r-sparse, u-free sequence can be made r'-sparse for 
r' > r at the cost of shrinking the sequence by at most a constant factor. 



7 The lower-bound construction for s > 4 even 

In this section we present a construction that achieves the lower bounds ([2]). This is a 
simpler variant of the construction of Agarwal, Sharir, and Shor [21 [16] that achieves 
the same bounds. 

We first construct a family of sequences S^{m) for s > 2 even, k > 0, and m > 1. 
For all s > 4, m > 2, the sequences Sl{m) are Davenport-Schinzel sequences of order s. 
The sequences Sl{m) are highly regular; they satisfy the following properties: 
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• SKm) is a concatenation of blocks of length m, where each block contains m 
distinct symbols. (For s = 2 or m = 1 there are adjacent repeated symbols at the 
interface between blocks, but only in these cases.) 

• S^{m) does not contain any forbidden alternation abab . . . of length s + 2, for 
any distinct symbols a ^ b. Thus, for s > 4, m > 2, the sequence Sl{m) is a 
Davenport-Schinzel sequence of order s. 

• All symbols in Sf.{m) occur with the same multiplicity ^s{k), which depends only 
on s and k. Further, for s > 4 each symbol in Sf.{m) makes all its appearances in 
the same position within the blocks, and no two symbols a, b appear together in 
more than one block. 

7.1 The construction 

For s = 2, the sequences 5'|(m) are given (independently of k) by 

Sl{m) = 12 . . . m m . . . 21. 

S'^{m) consists of two blocks of length m, and each symbol occurs with multiplicity 
fi2{k) = 2. Clearly, S'^{m) contains no forbidden alternation abab. 

The construction for general s > 4 is as follows. For /c = 0, we let SQ^m) consist of 
a single block of length m: 

S^{m) = 12... m. (45) 

Thus, fisiO) = 1. 

For general A; > 1, we proceed as follows. The sequence Sf.{l) consists of 

fis{k) = iJs~2{k-l)l^s{k-l) (46) 

copies of the symbol 1, each forming by itself a block of length one. Equation ( |46| ), 
together with the bounding cases ^2{k) = 2 and /Us(0) = 1 for s > 4, gives the recursive 
definition of Hsik). 

For m > 2, the sequence S^{m) is constructed inductively on the lexicographic order 
of the triples {s,k,m), using three previously created sequences as components. 

The first sequence is S' = S^{m — 1); note that S' contains blocks of length m — 1. 
Let / be the number of blocks in S' . 

The second sequence is 5 = S^Ziif)- Thus, S contains blocks of length /. Let 
g = 1 1 5*1 1 be the number of distinct symbols in S. 

The third and final sequence is S* = Sl_i{g). Thus, S* contains blocks of length g. 

Transform the sequence S* into a sequence S* by replacing each block in 5* by a 
copy of S with the same set of g symbols, making their first appearances in the same 
order as in the replaced block. Note that S* contains blocks of length /. Further, the 
multiplicity of each symbol in S* is the product of the symbol multiplicities in S and 
S*; by incduction this multiplicity equals 

Hs-2ik - l)iis{k - 1) = Ms(fc). 
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Figure 2: The recursive construction of Sf,{m). The sequence S* is the result of replacing 
each block of S* by a copy of S. Each block of S* is then distributed among the / blocks 
of a single copy of S' . 

Let h be the number of blocks in S* . 

Now, create h copies of S", each copy using "fresh" symbols which do not occur in 
S* nor in any preceding copy of S", and concatenate them into a sequence S" . Note 
that S" contains fh blocks of length m — 1, while p*] = fh. 

Insert each symbol of S* in order at the end of each block of 5". Thus, each 
component sequence S' in S" , containing / blocks, receives the / distinct symbols of a 
block in S*. The resulting sequence is the desired 5|(m). Note that it contains blocks of 
length m, and, by induction and construction, each symbol in it has multiplicity Hs{k). 
See Figure [2} 

Letting t = s/2 — 1, we have 



if we take s to be a constant. 

7.2 Correctness of the construction 

We now prove that, for s > 4, m > 2, the sequences 5'|(m) are indeed Davenport- 
Schinzel sequences of order s. 

Let us first recall some important properties of the construction: 

• The last symbol in each block of Sf,{m) comes from S* (which has the same set of 
symbols as S*), while every other symbol in S'Km) comes from a copy of S'. 

• The copies of S' have pairwise disjoint sets of symbols, which are also disjoint from 
the set of symbols of S* . 

• When merging S" and S* to form S'^(m), each copy of 5' in S" receives the / 
distinct symbols of a block of S*. 



(47) 
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The following lemma is easily proven by induction using the above properties: 
Lemma 7.1. The sequence Sl{m) satisfies the following properties: 

1. For s > 4, each symbol in the sequence makes all its appearances in the same 
position within the blocks. 

2. For s > 4, m>2, there are no adjacent repeated symbols. 

3. For s > 4, no two symbols of Sf.{m) appear together in more than one block. 

For each symbol a in Sl{m), call the depth of a the position within the blocks in 
which a always appears in S'^(m). This notion is well-defined by the above lemma. 
Thus, the symbols that come from copies of S' have depth between 1 and m — 1, while 
the symbols that come from S* have depth m. 

The following Lemma is also pretty straightforward: 

Lemma 7.2. Symbols at different depths in S^{m) make alternations of length at most 
5. 

Proof. By induction. The claim is clearly true ifs = 2, fc = 0, orm = l. Thus, let 
s > 4, A: > 1, and m > 2. Let a and b be two symbols at different depths in 5'|(m). 

If both a and b have depth at most m — 1, then they either come from the same 
copy of S' , in which case the claim follows by induction, or else they come from different 
copies of S' , in which case they do not alternate at all. 

Thus, suppose one symbol, say a, has depth m (so it comes from S*), while the other 
symbol, b, has depth at most m — 1 (so it comes from a copy of S'). 

The copy of S' to which b belongs receives at most one a from S* . In the worst case, 
this a is surrounded by 6's from our copy of S", and this copy of S' is in turn surrounded 
by other a's from S* . Thus the longest alternation we can get is ababa. □ 

The main issue is to show that 5'^(m) contains no forbidden alternating subsequence 
of length s + 2. For this, we prove by induction that Sf,{m) satisfies a stronger property. 

Lemma 7.3. The sequence Sl{m) satisfies the following properties: 

1. S^{m) contains no forbidden alternation abab . . . of length s + 2. 

2. Furthermore, if each block B in Sf,{m) is replaced by a sequence T{B) on the same 
set of symbols as B, such that T{B) contains no alternation abab . . . of length s, 
and such that the symbols in T{B) make their first appearances in the same order 
as they did in B, then the resulting sequence still contains no forbidden alternation 
of length s + 2. 

Proof. Again by induction. Both properties clearly hold if s = 2, A; = 0, or m = 1, so 
let s > 4, /c > 1, and m>2. 

Assume by induction that Properties [l] and [2] hold for the sequences S", S, and S* 
from which S'^{m) is built. We want to show that these properties hold for S^{m) itself. 
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Figure 3: The left figure shows the case of s even. For a forbidden alternation to occur, 
a pair of symbols a, 6 in a common block must be replaced by an alternation of length at 
most s — 1, and extended to length s + 2 by at least three more symbols a, 5, according 
to one of four possible cases. In each case we get a contradiction. The right figure shows 
the case of s odd. Here the argument fails, because case (ii) fails to yield a contradiction. 



We start with Property [Tj Suppose for a contradiction that S^{m) contains a forbid- 
den alternation abab ... or baba ... of length s + 2. By Lemma |7.2[ a and b must have 
the same depth (since s + 2 > 6) . 

If a and b have depth at most m — 1, then they must belong to the same copy of 5", 
or else they would not alternate at all. But this contradicts our inductive assumption 
on S'. 

And if a and b have depth m and come from S* , then S* itself contains a forbidden 
alternation. But S* is obtained from S* via block replacements, exactly as described in 
Property |2] Thus, the inductive assumption on S* is contradicted. 

In conclusion, Sf,{m) cannot contain an alternation of length s + 2, so it satisfies 
Property [1} 

Now we show that S'^{m) satisfies Property |2j Suppose for a contradiction that, 
after performing a certain set of block replacements in Sf. (m) , we do get an alternation 
abab ... or baba ... of length s + 2. 

For this to happen, a and b must have appeared together in some block B of 5'|(m). 
(By Lemma 7.1 they do not appear together in more than one block.) Say that a 
appeared before b in this block. This block was replaced, in the worst case, by a 
sequence containing an alternation abab. . . of length s — 1. (Without loss of generality 
we may assume the alternation starts with an a, since the block replacement preserves 
the order of first appearances of the symbols.) 

This alternation is extended to length s + 2 by at least three more instances of a 
and b before or after the block B, according to one of four possible cases, as depicted in 
Figure [I (left). 

To see why none of these cases can occur, consider again where the symbols a and b 
came from. If a and b came from the same copy of S' , then the same block replacement 
in S' would also have generated a forbidden alternation of length s-|-2. This contradicts 
our inductive assumption for S' . 

Further, a and b could not have come from different copies of S', since then they 
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would not lie together in the same block (and they would not alternate at all). For a 
similar reason, they cannot both come from S* . 

Thus, one symbol — specifically, a — must originate from a copy of S' , and the other 
one — namely, b — must originate from S*. But all the other instances of a in S^(m), to 
the left or right of our block B, also come from the same copy of S' . A case analysis 
shows that in each of the four cases shown in Figure [s] (left), this copy of S' received 
two copies of b from S* . (In cases (i) and (ii) there are two 6's surrounded by a's, and 
in cases (iii) and (iv) there is a 6 surrounded by a's, plus another b lying in the same 
block as an a.) This is impossible according to our construction. □ 

Remark 7.4: Unfortunately, the above argument depends crucially on s being even. 
If we try to make the same argument with s odd, we get the four cases illustrated in 
Figure [s] (right), and in case (ii) we fail to get a contradiction — we cannot find two 
instances of b sent to the same copy of S' . 

7.3 Analysis 

Given a fixed even number s > 4, take the sequences Sf,{2), for k = 0,1,2, .. .. These 
are Davenport-Schinzel sequences of order s, in which the multiplicity of the symbols, 
IJ-s{k), goes to infinity. Thus, the length of these sequences grows superlinearly in the 
number of symbols. We want to derive the exact relation between these two quantities. 
For this purpose, we derive an upper bound on the number of distinct symbols in Sf.{2). 

Let N^{m) = \\S^{m)\\ denote the number of distinct symbols in 5^(m), and let 
F^{m) be the number of blocks in Sf,{m). Then, 

\Si{m)\ = lis[k)Ni[m) = mF^{m). (48) 

The quantities N^{m) are initialized by 

Nl{m) = m; 
NQ{m) = m; 
N^l) = 1. 

To get a recurrence relation for the general case, we analyze the recursive construction 
of S^{m). Using the notation there, we have 

f = F^{m-iy, 

h = FU{9) ■ F^zlU); 

mim)=NUi9) + h-mim-l). 
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Thus, applying (48) three times and then (46), 



fisik - l)N'^_^ig) /x,_2(fc-l)-5 {m-l)-f 



y 

m-NU[K-l{mm-l))). 



f 



Since /^s(fc) < 2 and m > 1, by (48) we have 



(49) 



so 



N^{m) < m . NU{Krl{^''' mm - 1))). 



We now simphfy the analysis by getting rid of the dependence on s in the last 
inequality. For this, we define an Ackermann-like hierarchy of functions ^^(m) for 
A; > 0, m > 1, by 

Ao(m) = m; 



and 



Ak{m) 



1, 



if m = 1; 
otherwise; 



for A; > 1 (compare to (|4j)). It follows by induction that 

Nl{m) < Ak{m) 
for all s. A;, and m. In Appendix [C] we prove that 

Ak{m) < Ak-\-i{2m + 4) for all A; > 2 and all m. 



(50) 



(51) 



Now let us come back to the sequences with which we started this discussion. Let 
Tfc = S'|(2) for A; = 0, 1, 2, . . ., and let = ||rfc||. Then, applying (|50]), ([5l|, and ([S]), 

nk = mi'^)<M2)<Ak+i{8)<Ak+i{A{k + 2))=A{k + 3). 



Therefore, k > a{nk) — 3. Substituting into ( |48| ) applying (47), and letting t = s/2 — 1, 

\Tk\ = ■ fisik) > ■ fisHuk) - 3) > nfc • 2(iA!)"(n.)'-o(-(n.)*-^) . 

We have thus achieved the desired lower bound on As(n) for n of the form n = ri]^. 
As in Section [6| interpolating to intermediate values of n (for < n < n^+i) is 
straightforward, and we obtain the desired bound for all n. 
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7.4 Advantages over the previous construction 

The construction we just presented follows the same basic idea as the previous construc- 
tion of Agarwal et al. [21 [16], but it has the following advantages: 

• In our construction each block is just a sequence of m distinct symbols. In the 
previous construction each block (there called a "fan") is of the form 12 . . . m . . . 21. 

• In our construction all symbols have the same exact multiplicity. This greatly 
simplifies calculations. 

• In our construction there are no adjacent repeated symbols at the interface between 
blocks. (Removing these adjacent repetitions in the previous construction does not 
present any serious problem, but they constitute a small aesthetic blemish.) 

• The previous construction involves some "tiny" duplications of symbols, which our 
construction does not have. These duplications are not the cause of the asymptotic 
growth (and indeed, our construction works fine without them). This is a potential 
source of confusion, especially since these "tiny" duplications are also present in 
the lower-bound construction for order-3 sequences, and in that case they are 
critical. 



7.5 Lower bounds for the number of symbols in almost-DS sequences 
of even order s > 4 

As was the case with the construction of order 3, the construction described in this 
section yields lower bounds for n|(m), for s > 4 even. Again, the idea is to look at the 
rows of the construction, namely at S^{m) for fixed s and k. 

Lemma 7.5. For every fixed even s > 4 and every k > 4 we have 

II*(x) > xak{x) for all large enough x, 
for some fj, asymptotically of the form 

where t = s/2 — 1. Moreover, these lower hounds can he achieved hy actual Davenport- 
Schinzel sequences. 



The proof is similar to the proof of Lemma 6.4 , though somewhat simpler, since the 



blocks in S^{m) have uniform length. We omit the details. 



As before. Lemma 7.5 automatically yields lower bounds for n'.r,s,fc(?7T-) for odd s > 5. 
It is an open problem whether the lower bounds for the case s = 3 shown above (Sec- 
tion 6.2 ) can be achieved with actual Davenport-Schinzel sequences (without adjacent 



repeated symbols), as was the case here. 
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8 Conclusion and open problems 



The bounds for As(n) are now tight for every even s. Unfortunately, for odd s > 5 the 
problem is still not completely solved. We believe the new upper bounds for odd s are 
the true bounds, simply by analogy to the interval-chain bounds. But the construction 
that gives the lower bounds does not seem to work when s is odd. 

Are there other problems that, like interval chains and almost-DS sequences, satisfy 



recurrences like Recurrence 4.7 and Recurrence 4.11 ^' If so, it would be interesting to 
find more examples of such problems. 

The reason we can unambiguously talk about the coefficient that multiplies a(n) 
(e.g., in Theorems 1.2 and 1.6), despite the fact that there are several different versions 



of a(n) in the literature, is that all these versions differ from one another by at most an 
additive constant. Thus, the coefficient multiplying a(n) is not affected. On the other 
hand, one cannot talk about the leading coefficient in A4(n) = 0(n • 2"*^"^), for example, 
unless a standard definition of a{n) is agreed upon. 

Can our lower-bound construction for A3(n) (Section [6| be realized as the lower 
envelope of segments in the plane? If so, it would yield a factor-of-2 improvement for 
this problem as well. 
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A Proof of Klazar's Lemma 



4.9 



For completeness, we include here the proof of Klazer's Lemma 4.9 Recall that the 



claim is that A3(n) < V'3(l + 2n/^, n) + 3ni, where i < n is a free parameter. 



Proof of Lemma 4-9 ■ Let S" be a maximum-length Davenport-Schinzel sequence of order 
3 on n distinct symbols. Thus, IS"! = A3(n). Call an occurrence of a symbol a in 5 a 
terminal occurrence if it is the first or last occurrence of a in S. 

Partition S into blocks S = S1S2S3 . . . Sm, where each Si starts with a terminal 
occurrence and contains exactly £ terminal occurrences (except for Sm, which might 
contain fewer terminal occurrences). Since S contains 2n terminal occurrences, the 
number of blocks is m = \2n/£'\ < 1 + 2n/i. 

For every block Si and every symbol o, let ni(a) be the number of occurrences of a 
in Si. Recall that these occurrences must be nonadjacent. If Si contains the first or last 
occurrence of a in S, we say that a is terminal in Si] otherwise, a is nonterminal in Si. 

Let Aj be the set of symbols that appear in Si. Let be the subset of these symbols 
which are terminal in Si, and let A^' be the subset of those which are nonterminal. 
Clearly, 

= \\Si\\ + ^ {ni{a) - 1). 

aeAi 

We claim that nj(a) < i for all a G Aj. Indeed, suppose for a contradiction that 
ni{a) > £ + 1 for some a G Aj. Then the occurrences of a in Si define £ interior-disjoint, 
nonempty intervals. But Si contains at most £ terminal occurrences of symbols, one of 
which is the first symbol of Si. Therefore, one of the above-mentioned intervals must 
be free of terminal occurrences, and so it contains a symbol b which also appears both 
before and after the interval. Thus, S contains babab, which is a contradiction. 

For a similar reason. Si cannot contain the pattern aba for any a,b £ A". Therefore, 
the nonterminal symbols in Si do not intermingle at all (meaning, for every a,b G A^', all 
occurrences of a appear before all occurrences of b or vice versa). Therefore, the symbols 
which are nonterminal in Si define "^a&A" (™i(^)~l) interior-disjoint, nonempty intervals 
of the form a . . .a in Si. On the other hand, the number of such intervals cannot be 
larger than £ — 1 (by an argument similar to the one above). Therefore, 

\Si\ = + ^ {ni{a) -l)+Y. (^^(«) - 1) 

< ||5,|| + (^-l)|A^| + (^-l) 

< ||5,|| + £{£ - 1) + (^ - 1) = \\Si\\ +£^-1. 

Now, define a subsequence S' of S by taking just the first occurrence of each symbol 
in each Si. Then, \S'\ = Yl^i W^iWi is composed of m blocks, each of distinct 

symbols. S' might still contain adjacent repeated symbols at the interface between 
blocks, but these can be eliminated by deleting at most m — 1 <2n/£ symbols. We get 
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a Davenport-Schinzel sequence S" which satisfies \S"\ < ips^m,!!,), and thus 

m m 

A3(n) = 1^1 = ^ \Si\ < - 1) + ^ \\Si\\ 



i=l i=l 
/l2 



< (1 + 2n/fi){t - 1) + V'3("^, n) + 2n/t 

< ^3(1 +2n/£,n) + 3n^. □ 

B On the asymptotic growth of some recurrent quantities 

A recurrent feature in this paper are two-parameter quantities given roughly by Cs^k ~ 
Cs-2,kCs,k-i: with base cases C^^k = Q{k) and C4^fc = 0(2^^). (Specifically, we have the 
quantities Pg^k and Qs^k in Section[3j and Rs{d) in Section|4j See also /is(fc) in Section[7| 
There are also similar quantities in In this appendix we give a generic analysis of 
the asymptotic growth of such quantities (as a function of k for s fixed) . 

Lemma B.l. Let Cg^k be defined recursively for s > 3, k > 1 by 

C3,k = e{ky, 

Cs,k = Cs_2,kCs,k-i + aCs-i,k, for s>5, k> 2; 

for some implicit constants for C-^^k ^n-d C^^kj some nonnegative constant a = a{s), and 
some initial conditions Cs i • Then for every fixed s >3 we have 



C. 



s,k 



'2(iA!)fc'±o(fc-i)^ s even; 
2(iA!)fc*iog,fc±o(fc*) g^^^ 



where t = [{s - 2)/2j. 



Remark B.2: The coefficient a of Cg-i^k is nonnegative in all applications of Lemma B.l 
we need, so we do not consider the case where a is negative. And if the recurrence 
equation for Cs^k had other lower-order terms such as a'Cs-2,k or a"Cs^k-i (for a', a" 
positive or negative), they could be handled quite easily, and they would not affect the 
asymptotic growth of fc; therefore, we omit them for simplicity. 



Proof of Lemma B.l Let s > 5, and assume by induction that Cg^i k-, Cs-2 k have the 



claimed growth in k. Let Cs^k = log2 Cs,k- Then, 

Cs,k > Cs-2,k + Cs h-1, 



so 



Cs,k > ^ Cs-2,': 



i=2 
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Using the assumed growth for Cs-2,k^ and bounding the resulting sum by an integral, we 
conclude that 

^ f^A;*-0(A;*-i), seven; 
^"''^ ~ ' ^A;*log2A:-0(fc*), s odd; 

implying the lower bound for Cs^k- 

From this lower bound for Cs^k it follows that Cs,k-i > aC's-i,fc for all large enough 
A;, and therefore, 

Cs,k < 2Cs_2,fcC's,fc-i for ah large enough k. 
Thus, the upper bound for Cs,k follows similarly. □ 

C Comparing Acker mann- like functions 

In this appendix we present a general technique for proving that variants of the Acker- 
mann hierarchy exhibit equivalent rates of growth. We first give the lemma on which 
the technique is based, and then we illustrate the technique by proving that the function 



Ak{m) of Section 7.3 satisfies Ak{m) < Ak+i{2m + 4). 

This is basically the same technique as in Appendix B of ^i, but rephrased so as 
to deal with rapidly growing functions instead of their slowly growing inverses. Our 
technique here is also slightly more general than the one in [ij . 

We consider the following general setting. Suppose F{n) and G{n) are nondecreasing 
functions that satisfy F(n),G{n) > n for all n. Define functions F°(n), G°{n) by 
F°(n) = F('^)(Fo), G°(n) = G(")(Go), with some initial conditions Fq, Gq. (Recah that 
Z*^") denotes the n-fold composition of /.) 

We want to prove that F°{n) < G°{dn + c) for some constants d and c. The following 
lemma gives a sufficient condition for this. 

Lemma C.l. Let F(n), G{n), F°{n), G°{n) he functions as given above. Suppose there 
exists an integer d and a function 5{n) such that 

n < 5{n), (52) 
5{F{n)) < G^'^\5{n)), (53) 

for all n > 1. Then F°{n) < G°{dn + c) for a constant c large enough that 

6{Fo) < G°{c). (54) 



Proof. Applying (52), then l\53h n times, and then <\54h, 



F°{n) = f('^)(Fo) < (^(f(")(Fo)) < G('^")(5(Fo)) 

< G('^")(G°(c)) = G('^"+^)(Go) = G°{dn + c). □ 

Now let us apply this technique to the task at hand. 
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Lemma C.2. Let Ak{m) be given by 

Ao^m) = m, for m > 1; 

and 



1, 



[m ■ Ak-i(^Ak-i(2'^ Ak{m-1) 
for k > 1. Then Ai^{m) < ylfc_|_i(2m + 4) for all k >2 and all m. 
Proof. We start by noting that 

li(m) = 22^"-^! <2™'. 



if m = 1; 
otherwise: 



(55) 



(56) 



Unfortunately the recurrence (55 ) does not fit the general setting of Lemma C.l because 



of the factor m in it. But it is not hard to show that 



Akim) < Ak^i(^Ak-i(2^ ' Akim - for m > 2, 

so we will use this recurrence instead (the penalty we pay is minimal). We are going to 



apply Lemma C.l with d = 2, with 

F{m) = Ak-i(^Ak-i(2^''m) ) , 
G(m) = Ak{m), 

and with the initial conditions Fq = Go = 1. Thus, 

F°{m) > Ife(m), 
G°{m) = Ak+i{m). 



(57) 



Let us start with the case k = 2. In this case we have, by (56), 



F(m)=li(li(16m))2<22"''" ^\ 
G{m) = 2"^. 

Then an appropriate choice of 5 is 5{rn) = 600m'^, since 



6{F{m)) <5[2 



0512m^ + l 



600 • 23-2^"™'+^ < 22'"°'"' 



G{G{5{m))) 



for all m > 1, and so 5 satisfies (53). Further, it is enough to take c = 4 in (54), since 



G°(4) = 22' >515 = 5(Fo). 
We conclude that A2{m) < A^{2m + 4). 



47 



Now we deal with the general case A; > 3. Suppose by induction that Ak^i(m) < 
Ak{2m + 4). Substituting this into (57), 

F{m) <Ak(2Ak(2'^'+^m + 4^ +4)^ 

Now it is easy to see that taking 5(m) = 2'^''^^'m + 5 guarantees that 

6{F{m)) < Ak{Ak{6{m))) = G{G{6{m))) 
for all m > 1. Furthermore, we have 

G%4) = Ak+i{4) > 2^'+' + 5 = 6{Fo). 
We conclude that Ak{m) < Ak+i{2m + 4), as desired. □ 
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